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Abstract 

This report documents work performed for NASA Grant NAG5-2006 for the period July 1, 
1993 through June 30, 1994. During this period, significant developments to the Communication 
Link and Error ANalysis (CLEAN) simulator were completed. Many of these were reported in 
the Semi-Annual report dated December 1993 which has been included in this report in 
Appendix A. Since December 1993, a number of additional modules have been added involving 
Unit-Memory Convolutional codes (UMC). These are: 

1 ) Unit-Memory Convolutional Encoder module (UMCEncd) 

2) Hard decision Unit-Memory Convolutional Decoder using the Viterbi decoding 
algorithm (VitUMC) 

3) A number of utility modules designed to investigate the performance of UMC’s such 
as 

a) UMC column distance function (UMCdc) 

b) UMC free distance function (UMCdfree) 

c) UMC row distance function (UMCdr) 

d) UMC Transformation (UMCTrans) 

The study of UMC’s was driven, in part, by the desire to investigate high-rate convolutional 
codes which are better suited as inner codes for a concatenated coding scheme. A number of 
high-rate UMC’s were found which are good candidates for inner codes. 

Besides the further developments of the simulation, a study was performed to construct a 
table of the best known Unit-Memory Convolutional codes. To date, a total of 100 new UMC’s 
were found which are better than any previously known UMC’s. These results have been 
submitted and accepted to the IEEE Transactions on Information Theory and will appear in print 
soon. A copy of the final paper is included in Appendix B for reference. 
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I. Introduction 


During the past year, CLEAN capabilities have grown substantially. Most of the new 
programs are briefly described in the semi-Annual report included in Appendix A. Among the 
developments is the integration of the RICE compression/decompression software into the 
simulation. Not included in that report are a number of modules involving encoding, decoding, 
and analysis of Unit-Memory convolutional codes. 



Appendix A 


Semi-Annual Report for the period July 1, 1993 through December 30, 1993 



The Communication Link and Error ANalysis (CLEAN) 

Simulator 


NASA GRANT NAG5-2006 
July 1, 1993 - June 30, 1994 


Semi-Annual Report 
July 1, 1993 - December 30, 1993 


Submitted to: 


Mr. Warner Miller 
Code 728.4 

Instrument Electronic Systems Branch 
Engineering Directorate 
NASA/Goddard Space Flight Center 
Greenbelt, MD 20771 
301-286-8183 


Submitted by: 

William J. Ebel, Ph.D. 
Frank M. Ingels, Ph.D. 
Shane Crowe 

Mississippi State University 
Drawer EE 

Mississippi State, MS 39762 
601-325-3912 


December 1993 



Table of Contents 


Abstract i 

I. Introduction 1 

II. Further Developments to Clean 7 

A. Soft Decision Program Modules 7 

1. iidsoft 7 

2. bstysoft 8 

3. displsft 8 

4. harden 9 

5. soften 9 

6. dpcisoft 9 

7. vitsoft 10 

8. vit3sync 1 1 

B. Markov Chain Program Modules 11 

1. markov 12 

2. markup 12 

3. markdown 12 

4. markjoin 13 

5. displmrk 13 

6. deintmrk 13 

C. RICE Program Modules 14 

1. ricecomp 14 

2. ricedcmp 14 

3. img2seq 14 

4. seq2img 14 

D. Miscellaneous Program Modules 14 

1. convened 15 

2. delete 15 

3. insert 15 

4. help 15 

5. madd 16 

6. pnseq 17 

7. trellis 17 

8. genhdf 18 

III. Periodic Convolutional Interleaver 20 

BIBLIOGRAPHY 44 



1 


Abstract 

This report documents work performed for NASA Grant NAG5-2006 for the period July 1, 
1993 through December 30, 1993. During this period, significant developments to the 
Communication Link and Error ANalysis (CLEAN) simulator were completed and include: 

1 ) Soft decision Viterbi decoding 

2) Node synchronization for the Soft decision Viterbi decoder 

3) Insertion/deletion error programs 

4) Convolutional Encoder 

5) Programs to investigate new convolutional codes 

6) Pseudo-Noise sequence generator 

7) Soft decision data generator 

8) RICE compression/decompression (integration of RICE code generated by Pen-Shu 
Yeh at Goddard Space Flight Center) 

9) Markov Chain channel modeling 

10) % complete indicator when a program is executed 

11) Header documentation 

12) Help utility 

The CLEAN simulation tool is now capable of simulating a very wide variety of satellite 
communication links including the TDRSS downlink with RFI. The RICE 
compression/decompression schemes allow studies to be performed on error effects on RICE 
decompressed data. The Markov Chain modeling programs allow channels with memory to be 
simulated. Memory results from filtering, forward error correction encoding/decoding, 
differential encoding/decoding, channel RFI, non-linear transponders and from many other 
satellite system processes. 

Besides the development of the simulation, a study was performed to determine whether 
the PCI provides a performance improvement for the TDRSS downlink. There exist RFI with 
several duty cycles for the TDRSS downlink. We conclude that the PCI does not improve 
performance for any of these interferers except possibly one which occurs for the TDRS East. 
Therefore, the usefulness of the PCI is a function of the time spent transmitting data to the 
WSGT through the TDRS East transponder. 
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I. Introduction 

During the past 6 months, CLEAN capabilities have grown substantially. Most of the new 
programs are briefly described in Section II. Among the developments is the integration of the 
RICE compression/decompression software into the simulation. In the Appendix, the theory of 
RICE compression is described along with a description of CLEAN implementation. In Section 
III, some results on the question of whether the PCI is really necessary for the TDRSS downlink 
is discussed. 

To help run the source code, the following list in given which provides a quick overview of 
the required input files and the output files which are associated with each program of CLEAN. 


filename(s).pdf > 


id.prm > 

binerrs.prm > 


id.prm 

bingap.prm 


id.prm > 

blkdecodprm > 

seq.err > 


id.prm > 

blkdeint.prm > 

seq.err > 


id.prm 

brsterrs.prm 


id.prm 

bstyerTS.prm 


> 

-> 


ascii.exe 


binerrs.exe 


bingap.exe | 


blkdecod.exe 1 


blkdeint.exe 


brsterrs.exe j 


bstyerrs.exe 


bstysoft.exe 


■> specified.file 


> binerrs.ID 
■> seq.err 


■> bingap.ID 
■> bingap.pdf 


-> blkdecod.ID 
> seq.err 


> blkdeint.ID 

> seq.err 


> brsterrs.ID 
■> seq.err 


> bstyerrs.ID 

> seq.err 


id.prm 

bstyerrs.prm 


> softmap.dat 
•> bstysoft.ID 
•> seq.sft 
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id.prm > 

seq.(err,sft,mrk) #1,#2 -> 



> compseq.ID 


id.prm > 

cvmblk.prm > 

seq.err > 



cvmblk.exe 


> cvmblk.ID 



id.prm > 


seq.err — 


— > 


cvmseq.exe 


> cvmseq.ID 


id.prm > 

blkdeint.prm > 

seq.mrk > 



> deintmrk.ID 

> seq.mrk 


id.prm 

deltaest.prm 
seq.err 


deltaest.exe 


■> deltaestID 


i<Lprm 

seq.err 


displerr.exe 


> displerr.ID 

> seq.err screen dump 


icLprm - 
seq.mrk 


di^lmik^exe| 


> displmricID 
■> seq.mrk screen dump 


id.prm > 

interactive inputs > 

seq.(err,sft,mrk) > 


1 

displseg.exe R 


■> displseg.ID 
•> segment screen dump 


id.prm > 

interactive inputs > 

seq.(err,sft,mrk) > 


displseq.exe 


> dispiseq.ID 

> sequence screendump 


id.prm ' 

seq.sft > 


displsft.exe 


■> displsft.ID 
■> seq.sft screen dump 













■> dpci.ID 
■> seq.err 
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id.prm > 

dpci.prm > 

seq.err > 


dpci.exe 



.. 



id.prm > 

dpci.prm > 

seq.sft > 



dpcisoft.exe 


■> dpcisoft.ID 
> seq.sft 


id.prm > 

WTFF error file > 


eosconv.exe 


> eosconv.ID 

> seq.err 


WTFF error file > 


eoshex.exe 


— > file screen dump 


id.prm 

gapest.prm - 
seq.err 


interactive inputs > 


id.prm > 

interactive inputs > 



genhdf.exe 



■> gapestID 


■> genhdf.ID 


> genmap.ID 


id.prm 

seq.sft 


harden.exe | > harden.ID 

I > seq.err 


help.exe 


> help screen dump 


> 

-> 


iidsoft.exe 



id.prm 

binerrs.prm 


> iidsoft.ID 

> seq.sft 
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id.prm > 

interactive inputs > 


img2seq.exe 


> img2seq.ID 

> user file 


id.prm > 

interactive inputs > 


intvbin.exe 


> intvbin.ID 

> intvbin.pdf 


id.prm > 

seq.err > 


intvpdf.exe 






intvpdf.ID 

interval.pdf 


id.prm > 

joinseq.prm > 

seq.err > 


joinseq.exe 


> joinseq.ID 

> seq.err 


id.prm > 

interactive inputs > 


maddexe 


> madd.ID 


icLprm > 

mafilt.prm > 

seq.err > 


mafilt.exe I > mafilt.ID 


id.prm 

markdown.prm 
seq.mrk 


> 


markdown.exe | > markdown. ID 

I > seq.err 


id.prm 

markjoin.prm 
seq.mrk 


markjoin.exe | > markjoin.ID 


id.prm 

markov.prm 


markov.exe 


— > markov.ID 
— > seq.mrk 


— > 

> 

— > 



icLprm 

markup, prm — 
seq.mrk 


-> markup. ID 
-> seq.err 













id.prm 



> nrzmdec.ID 

> seq.err 


> nrzmencd.ID 

> seq.err 


> pnseq.ID 

> sequence.pn 


> quantpdf.ID 

> user pdf file 


> header.(extension) 


rawhdr.exe 


-> rawhdr.ID 


ncecomp.exe 


-> ricecomp.ID 


ricedcmp.exe 


-> ricedcmp.ID 



— > seq2img.ID 
> user image file 


errseq.arc 













id.prm 

interactive inputs 


> 

-> 


-> seqtrunc.ID 
-> truncated seq 



id.prm > 

interactive inputs > 


sequnarc.exe 


> sequnarc.ID 

> seq. err 


id.prm > 

interactive inputs > 



seterrs.exe 



> seterrs.ID 

> seq.err 


id.prm 

cirri f* nrm 

> 

byiiL.pnii — — 

seq.err 



sync.exe 


> sync. ID 


interactive inputs > 


syncpb.exe 


■> syncpb.log 


interactive inputs > 


syncppn.exe II > syncppn.log 


id.prm > 

interactive inputs > 


trellis.exe 


> trellis.ID 

> trellis.plt 


id.prm 

viterbi.prm — 
seq.err 


— > 
— > 


vithard.exe | > vithard.ID 

II > seq.err 


id.prm 

viterbi.prm - 
seq.err 


vitmark.exe 


> vitmark.ID 

> seq.err 



> 



id.prm 

viterbi.prm 
seq.sft 


— > vitsoft.ID 
> seq.err 
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II. Further Developments to Clean 

This section briefly describes additional capabilities which have been added to CLEAN. 
The capabilities have been divided into two main sections. In Section A, additional error 
sequence mani pulation programs, which represent system components, are briefly described and 
in Section B, programs written to evaluate theoretical formulas are briefly described. 

A. Soft Decision Program Modules 

To more accurately reflect the receiver, programs were written to simulate soft decision 
values which are output by the demodulator for the real TDRSS. These programs involve "soft" 
sequence generation programs as well as programs to mimic the receiver DPCI and Viterbi 
decoder on those soft values. 

1. iidsoft 

This program generates a "soft" error sequence with independent and identically 
distributed soft event occurrences. By definition, an ERROR sequence MUST refer to hard 
decision data at the demodulator output. In contrast, this program simulates 3-bit soft decision 
data which would be output by a soft decision demodulator, assuming that the signal transmitted 
corresponds to the transmission of a binary zero. The algorithm involves using the channel error 
probability, input by the user through the parameter file, to construct the conditional Normal 
densities for the random variable which would be input to a multilevel thresholder to determine 
the 3-bit soft decision output. For convenience, it is assumed that the received signals are 
identically +1,-1 for a binary 1,0 respectively and that the decision thresholds, used to construct 
the 3-bit soft decision data numbers, are located at equi-spaced distances around +1 and -1 
inclusive. Then, the 3-bit binary number assigned to each level begins with 000 for the range 
below -1 and end with 1 1 1 for the range above +1. In summary, the thresholds are arbitrarily 
chosen as follows: 


3-bit value 

Binary rep. 

Low Thresh. 

High 

7 

Ill 

infinity 

3/2 

6 

110 

3/2 

1 

5 

101 

2 

1/2 

4 

100 

1/2 

0 

3 

011 

0 

-1/2 

2 

010 

-1/2 

-l 

1 

001 

-1 

-3/2 

0 

000 

- 3/2 

- infinity 


Note that the first binary value to be output is the least significant digit for the soft value. 
These threshold values were taken from Heller and Jacobs, "Viterbi Decoding for Satellite and 
Space Communication," IEEE Transactions Communication Technology, vol. COM 19, no. 5, 
October 1971, pp. 835-848. 
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To determine the soft decision values for each signal output, the probability of occurrence 
for each level must be found and subsequently used to statistically determine the sequence 
output. The probability that the i(th) soft value occurs is stored in SoftProb(i) which can be 
found using the Q(x) function. As implemented below, the cumulative SoftProb is stored in 
SoftProb, that is, SoftProb(i) represents the probability that soft value i, or i-1, ..., or 0 occurs. 
This is done to optimize execution speed. It is possible to threshold the soft sequence with a 
threshold of 0 to perform hard decision demodulation. 

This program inputs parameters from an ASCII data file with default name ’BinErrs.prm’ 
and outputs the "soft" error sequence to data file with default name ’seq.sft’. In addition, various 
statistics are output to an ASCII data file with default name TIDSoft.ID\ where ID is a three 
letter identifier for the current run which is input from file ’ID.prm’. 

The program is run by editing the parameter file ’BinErrs.prm’ and selecting the 
appropriate parameters and by choosing a program ID by editing file ’ID.prm’. Executing the 
program generates the ’seq.sft’ file which contains a sequence (in packed format) with 
independent and identically distributed soft values. It does not matter whether the output file 
’seq.sft’ exists or not. If it exists, it is overwritten without a prompt to the user. 


2. bstysoft 

This program generates a "soft" error sequence with bursty errors. The method for 
generating the soft values is discussed in the previous section for the iidsoft program 
documentation. The application here is identical except that two SoftProb functions are required: 
one when a burst is occurring and one when no burst is occurring. 

A discussion of the method by which the burst length and burst interval statistics are 
generated can be found in the documentation of program bstyerrs.for. 

This program inputs parameters from an ASCII data file with default name ’BstyErrs.prm’ 
and outputs the soft sequence to a data file with default name ’seq.sft\ In addition, various 
statistics are output to an ASCII data file with default name ’BstySoftlD’, where ID is a three 
letter identifier for the current run which is input from file ’ID.prm’. 

The program is run by editing the parameter file ’BstyEirs.prm’ and selecting the 
appropriate parameters and by choosing a program ID by editing file ’ID.prm’. Executing the 
program generates the ’seq.sft’ file which contains a "soft" error sequence (in packed format). It 
does not matter whether the file ’seq.sft’ exists or not. If it exists, it is overwritten without a 
prompt to the user. 

Even though Poisson distributed bursts may overlap in theory, this program does not allow 
bursts to overlap. The user must take care to specify input parameters so that the probability of 
overlapping burst is negligible. 


3. displsft 

This program displays the soft sequence found in file ’seq.sft’. It is assumed that the 3 -bit 
soft values stored in seq.sft are in the SSPS (Soft Sequence Packed Symbol) format. 
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4. harden 

This program reads in 3 -bit soft decision data and performs hard decision thresholding. 
This will effectively reduce the length of the sequence file by a factor of 3. 

The program reads in the soft sequence by blocks and performs hard decision thresholding 
on each block and then writes the modified block back out to the ’seq.err’ file. The program 
outputs several statistics to the user screen as well. Note that the sequence is read in from file 
’seq.sft’ (SeqType=2), with 3-bit soft data and is stored in file ’seq.err’ (SeqType=l), with hard 
errors. 

Executing the program causes the ’seq.sft’ file to be read which contains a soft value 
sequence (in packed format). The ’seq.sft’ file must exist prior to the execution of this program. 


5. soften 

This program maps binary data into soft values out of the soft decision demodulator. The 
method used to perform this mapping is to combine the data sequence with an already existing 
soft sequence. Consider a particular data bit and the corresponding soft value from the soft 
sequence. If the data bit is a zero, then the soft value which would occur at the demodulator 
output remains the same. However, if the data bit is a 1, then the soft value which would occur 
at the demodulator output is the bit complement of the corresponding soft value. The bit 
complement can be achieved by taking 8 and subtracting the base 10 equivalent of the soft value. 
For a discussion of how soft values are generated at the demodulator output, see Section 1 above, 

This program inputs the data from file ’seq.dat’ and the soft sequence from file ’seq.sft’ 
and stores the result in the ’seq.sft’ file. In addition, various statistics are output to an ASCII 
data file with default name ’Soften.ID’, where ID is a three letter identifier for the current run 
which is input from file ’ID.prm’. 

Executing the program causes the ’seq.sft’ file to be modified. Before running this 
program, sequences ’seq.dat’ and ’seq.sft’ must exist. 


6. dpcisoft 

This program performs Periodic Convolutional Deinterleaving of the soft sequence found 
in file ’seq.sft’. It is assumed that the channel symbols corresponding to those values have 
already been interleaved using an (Ntaps,M) periodic convolution interleaver. The method used 
to implement the function of the periodic convolutional interleaver is a series of formulas as 
described below. These functions are applied to a portion of the ’seq.sft’ array which is stored in 
a ring buffer. 

The method used to implement the deinterleaver involves constructing a Tap offset array 
which gives the offset for the soft sequence index to deinterleave next, based upon the tap 
position of the deinterleaver commutator. The Cycle offset is then used to determine the offset 
for the current commutator cycle number which is also used to determine the soft sequence index 
to de interleave. 

Note that there is a problem deinterleaving the end of the ’seq.sft’ file due to the sequential 
nature of the algorithm. The DPCI soft sequence file is truncated to eliminate the "don’t cares". 
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This program inputs parameters from an ASCII data file with default name ’DPCI.prm’ 
and outputs the soft sequence to data file with default name ’seq.sft’. In addition, various 
statistics are output to an ASCII data file with default name ’DPCISofUD’, where ID is a three 
letter identifier for the current run which is input from file TD.prm’. 

The program is run by editing the parameter file ’DPCI.prm’ and selecting the appropriate 
parameters and by choosing a program ID by editing file TD.prm’. Executing the program 
generates the ’seq.sft’ file which contains an soft sequence (in packed format) with deinterleaved 
values. The ’seq.sft’ file must exist prior to the execution of this program. 


7. vitsoft 

This program performs soft decision Viterbi decoding assuming ANY data sequence is 
transmitted. The Viterbi decoding algorithm assumes that the trellis begins at the all zero state 
for the first received code symbol. The end of the decoding process does not terminate with 
flush bits. Instead, steady state Viterbi decoding is performed up to the end of the data seq. 

This program inputs parameters from an ASCII data file with default name ’Viterbi.prm’ 
and inputs the soft sequence from file ’seq.sft’ and outputs the decoded data sequence to data file 
with default name ’seq.err’. In addition, various statistics are output to an ASCII data file with 
default name ’VitSoft.ID’, where ID is a three letter identifier for the current run which is input 
from file TD.prm’. 

The program is run by editing the parameter file ’Viterbi.prm’ and selecting the 
appropriate parameters and by choosing a program ID by editing file ’ID.prm’. Executing the 
program generates the ’seq.err’ file which contains an error sequence (in packed format) with the 
decoded error sequence. The ’seq.sft’ file must exist prior to the execution of this program. 

There are several assumptions associated with the implementation and output of this 
program. 

1) The path with the maximum probability metric at the i(th) Trellis stage is used to find the 

decoded bit for the output 

2) It is assumed that the convolutional encoder is either rate 1/2 or rate 1/3. It is straight 

forward to extrapolate this program to accommodate a rate 1/n encoder. It should also be 
possible to modify this program to accommodate a rate m/n encoder. 

The Viterbi algorithm, as implemented here, updates the Trellis by iterating through each 
of the states at the next stage. The probability metric for each path entering a given state are 
computed and the survivor is kept while the other sequence is discarded. In case of a tie, a coin 
is flipped (via a Uniform RV in [0,1]) to determine the survivor. The survivor is identified by 
updating the MLStateTrace array. This array contains the state of the previous Trellis stage 
which connects to the given state being processed. For example, suppose that we are now 
processing the next stage in the Trellis, we first consider state 1 at the next stage. After 
investigating the probability metric for the two possible paths entering state 1, we find that the 
survivor path came from state 3 of the previous Trellis stage. Therefore, MLStateTrace(i,l) — 3 
where i is the stage index. 
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To prevent overwriting the Metric array, two Metric arrays are alternately processed for 
each Trellis stage. This is why the algorithm performs two Trellis stage updates for each main 
loop. In the first Trellis stage update, the metrics are found in array MetricA and the new metrics 
are stored in MetricB. In the second Trellis stage update, the metrics are found in array MetricB 
and the new metrics are stored in MetricA. 

The Trellis is defined via three arrays; PathCodeSym, PathLink, and PathBit. Since this 
program only accommodates rate 1/2 or 1/3 encoders, only two paths enter each state at a given 
trellis stage. Therefore, if there are N trellis states, then there are only 2*N possible paths 
between two trellis stages. These are sequentially numbered from 1 to 2*N where path number 1 
and 2 enter state 1, path 3 and 4 enter state 2, etc. Array PathLink(i) gives the state number from 
which path i originates. Also, PathCodeSym(i) gives the code symbol associated with path i, and 
PathBit(i) gives the bit associated with path i. Taken together, these three arrays completely 
define the steady state trellis. 


8. vit3sync 

This program operates like the vitsoft program. However, this program mimics exactly 
what happens in the real LV7017C hardware which is documented in an interoffice 
Memorandum written by James Wang and Wei-Chung Peng of LinCom with subject, 

"Simulation and Validation of Viterbi Decoder", TM-8719-05-09 and TM-8707-06, 01 March 
1989. The vitsoft modifications performed to construct this program are as follows. 

1) The metrics which are accumulated are arbitrarily chosen as described in an interoffice 
Memorandum mentioned above. This program mimics exactly what occurs in the real LV7017C 
hardware. 

2) The metrics are monitored to determine whether node synchronization is lost. If node 
synchronization is lost, then the alternate bit pairings of the received data is chosen in an attempt 
to resync. The metrics are monitored again to determine whether synchronization has been 
established. 

This program inputs parameters from an ASCII data file with default name ’Viterbi.prm’ 
and inputs the soft sequence from file ’seq.sft’ and outputs the decoded data sequence to data file 
with default name ’seq.err’. In addition, various statistics are output to an ASCII data file with 
default name ’Vit3Sync.ID\ where ID is a three letter identifier for the current run which is input 
from file TD.prm’. 

The program is run by editing the parameter file ’Viterbi.prm’ and selecting the 
appropriate parameters and by choosing a program ID by editing file ’ID.prm’. Executing die 
program generates the ’seq.err’ file which contains an error sequence (in packed format) with the 
decoded error sequence. The ’seq.sft’ file must exist prior to the execution of this program. 


B. Markov Chain Program Modules 

Most processes which are used to manipulate and communicate binary data from a source 
to an end user can be modelled accurately by a Markov Chain. This includes differential coding, 
error correction coding, filtering, non-linearities, and more. In short, it should be possible to 
model the TDRSS downlink using a Markov Chain with an appropriate number of states. It is 
only necessary to determine the number of states and the transition probabilities. Estimating the 
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transition probabilities can be accomplished using the Baum-Welch algorithm [4]. Although the 
Baum-Welch algorithm has not been implemented in the simulation, programs which involve 
Markov Chains have been incorporated into the simulation to meet this goal. These are 
described below. 

1. markov 

This program generates a sample state sequence which is representative of a Markov Chain 
with known transition probability matrix. Each state is assigned a number from 0 to N-l where 
N is the number of states. 

This program inputs parameters from an ASCII data file with default name ’Markov.prm’ 
and outputs a state sequence with default name ’seq.mrk’. In addition, various statistics are 
output to an ASCII data file with default name ’Markov.ID’, where ID is a three letter identifier 
for the current run which is input from file TD.prm’. 

The program is run by editing the parameter file ’Markov.prm’ and selecting the 
appropriate parameters and by choosing a program ID by editing file TD.prm’. Executing the 
program generates the ’seq.mrk’ file which contains the state sequence It does not matter 
whether the output file ’seq.mrk’ exists or not. If it exists, it is overwritten without a prompt to 
the user. 


2. markup 

This program reads in a state sequence and performs hard decision thresholding for the 
upper bound case. 

The program reads in the state sequence by blocks and performs hard decision thresholding 
on each block and then writes the modified block out to file ’seq.err’. The program outputs 
several statistics to the user screen as well. Note that the sequence is read in from file ’seq.mrk’ 
(SeqType=5), with Markov Chain states and is stored in file ’seq.err’ (SeqType=l), with hard 
errors. 

Executing the program causes the ’seq.mrk’ file to be read which contains a state 
sequence. The ’seq.mrk’ file must exist prior to the execution of this program. 


3. markdown 

This program reads in a state sequence and performs hard decision thresholding for the 
lower bound case. 

The program reads in the state sequence by blocks and performs hard decision thresholding 
on each block and then writes the modified block out to file ’seq.err’. The program outputs 
several statistics to the user screen as well. Note that the sequence is read in from file ’seq.mrk’ 
(SeqType=5), with Markov Chain states and is stored in file ’seq.err’ (SeqType=l), with hard 
errors. 

Executing the program causes the ’seq.mrk’ file to be read which contains a state 
sequence. The ’seq.mrk’ file must exist prior to the execution of this program. 
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4. markjoin 

This program generates the joint event probabilities for joints events associated with 
received codewords in the state seq. It is assumed that each state in the received sequence 
corresponds to a code symbol. The algorithm involves partitioning the state sequence into 
n-state blocks, where n is the code blocklength, called the received codeword state. The number 
of each state which occurs within a received codeword state constitutes a single sample point for 
the joint state event. The number of each joint event is accumulated and the total for each is 
divided by the number of received codeword states to determine the empirical probability. The 
only problem with this procedure is defining an efficient method for identifying each joint event. 
The method used in this program is to define an array, Joint(i), in which all joint events would be 
stored in a unique location. If the Markov Chain has S states, then there are [(n+S+1) choose 
(n]> number of ways that a specific number of each state occurs in the received codeword state. 

If a received codeword state has nl, n 2, ..., nS number of occurrences of states si, s2, ..., sS, 
respectively, then the Joint array location which contains this joint event is computed on the fly 
as given in subroutine Statelndex. 

This program inputs parameters from an ASCII data file with default name ’MarkJoin.prm’ 
and inputs the state sequence from data file with default name ’seq.mrk’ and outputs the joint 
probabilities to ASCII data file with default name ’MarkJoint.ID’, where ID is a three letter 
identifier for the current run which is input from file TD.prm’. 

The program is run by editing the parameter file ’MarkJoin.prm’ and selecting the 
appropriate parameters and by choosing a program ID by editing file ’ID.prm’. The ’seq.mrk’ 
file must exist prior to the execution of this program. 


5. displmrk 

This program displays the sequence found in the file ’seq.mrk\ This file must be of type 
Seq Type = 5 (state). 


6. deintmrk 

This program performs block deinterleaving of the state sequence found in file ’seq.mrk\ 
It is assumed that the channel symbols corresponding to those states have already been 
interleaved using an (C,R,m) block interleaver. The deinterleaver groups every m state seq 
values together and deinterleaves them as a group. The method used to implement the function 
of the block interleaver is to read in a block of the state seq and to use a series of formulas to 
perform the block deinterleaving. These formulas are described in the blkdeint program [5] 

This program inputs parameters from an ASCII data file with default name 
’DeintMrk.prm’ and outputs the state sequence to data file with default name ’seq.mrk’. In 
addition, various statistics are output to an ASCII data file with default name ’DeintMrk.ID’, 
where ID is a three letter identifier for the current run which is input from file ’ID.prm’. 

The program is run by editing the parameter file ’DeintMrk.prm’ and selecting the 
appropriate parameters and by choosing a program ID by editing file ’ID.prm’. Executing the 
program generates the ’seq.mrk’ file which contains an state sequence with deinterleaved states. 
The ’seq.mrk’ file must exist prior to the execution of this program. 
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C. RICE Program Modules 

In an effort to investigate the interaction between RICE decompression and errors which 
may result from decoding failure, several programs were written to perform RICE 
compression/decompression and convert image sequences to/from the sequence format required 
by CLEAN. These are described here. 

1. ricecomp 

This is the same code received from Pen-Shu Yeh at Goddard Space Flight Center with 
slight modifications to work with CLEAN. The code reads in an image in JPL format and 
compresses it into a format defined by Pen-Shu Yeh. 


2. ricedcmp 

This is the same code received from Pen-Shu Yeh at Goddard Space Flight Center with 
slight modifications to work with CLEAN. The code reads in an image in JPL format and 
compresses it into a format defined by Pen-Shu Yeh. 


3. img2seq 

This program converts the Jet Propulsion Laboratory’s image file format (ASCII) to the 
CLEAN code data file format (packed). Both, the .img and .seq, filenames are specified by the 
user on the command line. This program works for RICE-compressed or uncompressed files. 

First the program reads the image header and writes it to the sequence file’s header. The 
program determines whether or not the file is compressed by reading character* 2 chi in the 
image header. Then the appropriate conversion routine is selected and executed. 


4. seq2img 

This program converts a sequence file to an image file in the Jet Propulsion Laboratory’s 
format. The sequence file must contain the proper image header data in the sequence header so 
that the image file will be constructed correctly. 

If chi character in the image header is ’Cl ’ the image will be written in the compressed 
image format. If chi is ’UO’ the image file will be written in the non-compressed format. If chi 
is neither of these, the program will end. 

Portions of this code are adapted from JPL’s source code. 


D. Miscellaneous Program Modules 

Several additional programs were developed to accommodate convolutional encoding, 
cycle slips in the demodulator which can cause insertion errors and deletion errors, as well as 
other programs described below. 



15 


1. convened 

This program performs convolutional encoding on a binary data sequence. The data is 
read in from file with default name ’seq.dat’ and the output is stored in a file with default name 
’codeseq.dat’. The encoder structure information is found in parameter file ’Viterbi.prm’ (see 
vithard for a description of these parameters). 

Executing the program causes the file ’codeseq.dat’ to be created or modified. Before 
running this program, sequence ’seq.dat’ must exist. 

There are no assumptions associated with the implementation or output of this program. 


2. delete 

This program simply deletes user specified soft values from a soft sequence. This process 
mimics bit deletions in the channel due to receiver PLL cycle slips. This program only works 
with soft decision sequences. 

This program inputs the soft values to be deleted from data file with default name 
’delete.dat’ and applies those deletions to sequence found in file ’seq.sft’. In addition, various 
statistics are output to an ASCII data file with default name ’Delete.ID’, where ID is a three 
letter identifier for the current run which is input from file ’ID.prm’. 

Executing the program causes the ’seq.sft’ file to be modified. Before running this 
program, data file ’delete.dat’ and sequence ’seq.sft’ must exist. 


3. insert 

This program simply inserts user specified soft values into a soft sequence. This process 
mimics bit insertions in the channel due to receiver PLL cycle slips. This program only works 
with soft decision sequences. 

This program inputs the soft values to be inserted into the data file with default name 
’insert.dat’ and inserts those into the sequence found in file ’seq.sft’. In addition, various 
statistics are output to an ASCII data file with default name ’Insert.ID’, where ID is a three letter 
identifier for the current run which is input from file ’ID.prm’. 

Executing the program causes the ’seq.sft’ file to be modified. Before running this 
program, data file ’insert.dat’ and sequence ’seq.sft’ must exist. 


4. help 

This program simply puts help type information to the user screen concerning the usage of 
the multiple executable programs which make up the CLEAN simulator. The information shown 
on the user screen is as follows: 
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** * 
* * ★ 


Communication Link and Error ANalysis 
(CLEAN) 

A communication link simulation tool 
Developed for: 

NASA Goddard Space Flight Center 


* ** 
* ** 
* * * 
* * * 


Developed by: 

Mississippi State University 
William J. Ebel, Ph.D. 
Drawer EE 

Mississippi State, MS 39762 
601-325-3912 


* ** 


* * * 


*•* * 
*** 
** * 

* * * 




* Simulation description: 

This simulation tool consists of a collection of separate executable 
programs which perform various operations found in the TDRSS downlink 
receiver. The simulation is based upon sequences which are expected to occur 
at the receiver threshold device output (hard or soft decision) . Complex 
systems can be simulated by executing the appropriate programs, corresponding 
to the operations found at the receiver, in the proper order. 


SIMULATION EXECUTABLES 


=> EVENT GENERATORS <= 

BinErrs: Binomial error generator 

BrstErrs: Burst error generator 
BstyErrs: Bursty error generator 
SetErrs: User set error seq 

BstySof t : Bursty Soft generator 
IIDSoft: Indep. Ident. Distr. Soft 

Markov: Markov Chain State generator 

=> MARKOV CHAIN PROGRAMS <= 

MarkDown: Conv. to lower bound errors 
Markup: Conv. to upper bound errors 

Mark Join : Estimate joint event prob 

=> INTERLEAVERS <= 

BlkDeint: Block deinterleaver 
DeintMrk: Block Deint for M.C. States 
DPCI : Error seq PCI Deinterleaver 

DPCISof t : Soft seq PCI Deinterleaver 

=> ERROR CORRECTING DECODERS <=* 

BlkDecod: Block, Reed- Solomon decoder 
VitHard: Viterbi hard decision decode 

VitMark: Viterbi decode w/ Markov eat 

VitSoft: Viterbi soft decision decode 

=> SYNCHRONIZATION PROGRAMS <= 

Sync: Seq. err Sync stat. gen. 

SyncPb: Theoretical sync stat. gen. 

SyncPPN: Theoretical sync stat. gen. 

=> MISCELLANEOUS <= 

GenHDF: Gen. Hamming Distance Fnc. 

GenMap: Soft value mapping gen. 

PNseq: Pseudo -Noise sequence gen. 

RawHdr: Show raw header (for debug) 

Trellis: Trellis generator 


=> NRZM UTILITIES <= 

NRZMDec: NRZM decoder 

NRZMEncd : NRZM encoder 

---=> RICE COMPRESSION PROGRAMS <= 

RICEComp: RICE compression (Pen -Shu) 
RICEDcmp: RICE decompression (Pen -Shu) 
Img2Seq: Image to seq. err conv. 

Seq2Img: seq. err to Image conv. 

=> STATISTICS <= 


DeltaEst : 

GAPES t: 

BinGAP: 

IntvPDF: 

IntvBin: 

CVMBlk : 

CVMSeq: 


Ascii : 

Comp Seq: 

DisplErr : 

DisplSeg: 

DisplSeq: 

DisplSf t : 

DisplMrk: 

EOS conv: 

EOShex: 

Harden: 

JoinSeq: 

MAdd: 

MAFilt: 

QuantPDF: 

Query Seq: 

SeqArc: 

SeqTrunc: 

SeqUnarc: 


Delta burst stat. est. 

GAP method burst stat. est. 
Binomial theor. GAP distr. 
Empirical interval distr. 
Interval distr. for bin errs 
CVM bin test by block 
Error seq CVM bin test 

--=> UTILITIES <- 

convert a PDF file to ascii 
Compare sequences 
Displ seq. err to screen 
Display sequence segment 
Display sequence to screen 
Displ seq.sft to screen 
Displ seq.mrk to screen 
EOS data conversion 
EOS data display in HEX 
Hard threshold soft values 
Join two sequences 
Exclusive OR two error seq 
Moving Average filter of seq 
Quantize PDF 
Query sequence header 
EOS sequence archiver 
Seq length truncator 
Sequence unarchiver 


5. madd 

This program modulo adds two binary data sequences. Each file name is specified by the 
user throught the keyboard. Both sequence files should be in packed format. The results of the 
modulo addition are stored in second file in packed format. If the two files are different lengths, 
the extra length is truncated. The program also outputs the error sequence error density based on 
the assumption that a ’ V corresponds to an error. 
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This program was written with the intention to modulo add the channel input to the channel 
output to yield the channel error sequence. The error sequence is stored in file with name 
SeqFileName2. 

6. pnseq 

This program generates a psuedo-noise (PN) sequence. The implementation used here is 
that of Figure 8-6, pg. 380 of "Digital Communications and Spread Spectrum Systems" by 
Ziemer and Peterson. 

The input parameters (data sequence length, generator polynomial order, and random 
number generator seed) are specified in a file called ’pnseq.prm’. Only orders of 7, 10 17, 20, 
25, or 28 are allowed. Any orders other that these will cease program execution. The maximum 
length sequence for each generator polynomial order is listed in ’pnseq.prm’. 

The shift register in the PN sequence generator is initialized with random binary values. 

The data sequence is stored in packed form in ’seq.dat’ 


7. trellis 

This program generates and displays for the user the convolutional encoder trellis diagram 
along with useful parameters. The program also generates a plot file which contains line 
segments which will physically form the shape of the trellis. 

This program was derived from the Trellis generation subroutine constructed for the 
Viterbi decoding program. 

The Trellis is defined via three arrays; PathCodeSym, PathLink, and PathBit. Since this 
program only accommodates rate 1/2 or 1/3 encoders, only two paths enter each state at a given 
trellis stage. Therefore, if there are N trellis states, then there are only 2*N possible paths 
between two trellis stages. These are sequentially numbered from 1 to 2*N where path number 1 
and 2 enter state 1, path 3 and 4 enter state 2, etc. Array PathLink(i) gives the state number from 
which path i originates. Also, PathCodeSym(i) gives the code symbol associated with path i, and 
PathBit(i) gives the bit associated with path i. Taken together, these three arrays completely 
define the steady state trellis. 

This program inputs parameters from an ASCII data file with default name ’Viterbi.prm’ 
and outputs the trellis structure along with useful parameters to an ASCII data file with default 
name Trellis.ID’, where ID is a three letter identifier for the current run which is input from file 
TD.prm’. 

The program is run by editing the parameter file ’Viterbi.prm’ and selecting the 
appropriate parameters and by choosing a program ID by editing file ’ID.prm’. 
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8. genhdf 

This program performs convolutional encoding of a binary sequence for a single generator 
function. For now, the program will iterate through all possible generator function for a given 
constraint length, and generate the "Hamming weight sequence" function, which is the Hamming 
distance for all possible input sequences, for each generator function. 

The method used to construct the unique input sequences is described next. Valid input 
sequences are all those possible which do not have a string of K consecutive zeros in them where 
K is the constraint length of the code. These sequences can be generated as follows: 

1) Construct the following prefix code: 

C = { 1,01, 001, 0001,. ..,0 K2 1 } 

where 0 K ' 2 denotes K-2 consecutive zeros. This is a prefix code because no vector in the 
set can be constructed from a group of other vectors 

2) Construct the first sequence as the first prefix code vector 1. 

3) Construct all subsequent sequences as combinations of the prefix code vectors as follows: 

a) Number the prefix code vectors as follows: 

Number Prefix Code 

0 1 
1 01 

2 001 

i 0*1 

K-2 (f' 2 \ 

Note that there are K-l prefix code numbers. 

b) Now let an integer counter, j, iterate from 0 on up 

c) Consider the j(th) integer counter value. Suppose it has a base K-l representation 
j = jO * (K-l)° + jl * (K-l )‘ + j2 * (K-l) 2 + ... 

where each coefficient is a number in the range 0,l,...,K-2. Next construct the base 
K-l number by concatenating the coefficients together: 

j base 10 = [... j2 jl jO] base (K-l) 

Now construct the j(th) Generator Hamming Distance function sequence by starting 
the sequence with a 1 and by concatenating the prefix code sequence for the base K-l 
coefficients in the order from least significant to most significant. That is, the input 
sequence is constructed by 

sequence = ... (J2 PC) (j 1 PC) (jO PC) 1 
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where PC stands for Prefix Code. 

The only problem with this formulation is that it excludes input sequences of the form 1' for 
integer i greater than 2. However, only input sequences up to a given length (MaxSeqLength) 
are constructed. Therefore, the input sequences of the form V for i=l,...,MaxSeqLength are 
constructed first and placed at the beginning of the sequence. This completes the description of 
how the encoder input sequences are constructed. 

This program was derived from the Trellis generation program constructed for the Viterbi 
decoding program. 

The Trellis is defined via three arrays; PathCodeSym, PathLink, and PathBit. Since this 
program only accommodates rate 1/2 or 1/3 encoders, only two paths enter each state at a given 
trellis stage. Therefore, if there are N trellis states, then there are only 2*N possible paths 
between two trellis stages. These are sequentially numbered from 1 to 2*N where path number 1 
and 2 enter state 1, path 3 and 4 enter state 2, etc. Array PathLink(i) gives the state number from 
which path i originates. Also, PathCodeSym(i) gives the code symbol associated with path i, and 
PathBit(i) gives the bit associated with path i. Taken together, these three arrays completely 
define the steady state trellis. 

This program inputs parameters from an ASCII data file with default name ’Viterbi.prm’ 
and outputs the trellis structure along with useful parameters to an ASCII data file with default 
name ’GenHDF.ID’, where ID is a three letter identifier for the current run which is input from 
file TD.prm’. 

The program is run by editing the parameter file ’Viterbi.prm’ and selecting the 
appropriate parameters and by choosing a program ID by editing file ’ID.prm’. 
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III. Periodic Convolutional Interleaver 

Recently, the issue as to whether the PCI is necessary in TDRSS has surfaced. Two 
documents (presentation slides) have addressed this issue, one by Warner Miller [1] and one by 
Ted Kaplan and Ted Berman [2] which give conflicting results. Below, the main points and 
results of the documents are outlined and it is shown that the results are not comparable due to 
the fact that the channel models used are fundamentally different. 

In document [1], OMV test results are presented to illustrate why the PCI is not necessary 
for the TRMM communications link. Bursts of a fixed length were input into the Viterbi 
decoder, one at a time, both with and without the PCI present to determine the effect (number of 
erred bits output by the Viterbi decoder). The main points of the document are as follows. 

1 ) Viterbi output bursts are not extended. This is not entirely true. If a burst (in terms of 
code symbols) of length B is input to the Viterbi decoder, then generally a burst (in BITS) 
of length B+M is output where M is a number less than the memory length of the decoder 
(32 for the LV7017). However, for the length of bursts considered for the OMV tests 
(>50), the slight increase in burst length is not noticeable. 

2) A (255,223,16) Reed-Solomon code can correct 16 code symbols or (at least) 628 
consecutive bit errors. This is correct. 

3) The OMV tests show that without the PCI, almost all the error bursts output by the 
Viterbi decoder can be corrected. When the PCI is present, synchronization loss causes 
error bursts at the Viterbi decoder output of >1000 bits which cannot be corrected by the 
RS decoder. 

4) The OMV test results conflict with the CLASS analysis performed by Ted Kaplan. 

In document [2], CLASS (it is assumed) is used to generate performance results which 
show that the PCI is necessary. The noise environment is modeled by Poisson occurring RFI 
pulses which affect 15 code symbols (30 binary symbols) at the Viterbi decoder input. The duty 
cycle of the RFI is taken to be .018 and thermal noise and False Loss of Viterbi Decoder 
Synchronization (FLDS) are ignored. The main points of the document, for the no PCI case, are 
as follows. 

1) Without the PCI, the Viterbi decoder can’t correct code symbol bursts of length 15. In 
fact, it is stated that the bursts at the Viterbi decoder output are longer than those at the 
input. See item (1) above. It is my belief that in principle. Document [1] agrees with this 
assessment. 

2) With the PCI, the errors at the Viterbi decoder output are not present unless PCI 
synchronization is lost. In essence, the error probability at the Viterbi decoder output 
with the PCI is much less than the Viterbi decoder output without the PCI. It is my belief 
that in principle, Document [1] agrees with this assessment. 

3) Therefore, it is concluded "that there should be an even larger difference after RS 
decoding (see Figure 1)". Figure 1 of Document [1] shows that the error probability at 
the RS decoder output is much worse without the PCI. This Figure is the source of the 
conflict between the OMV test results and CLASS results. 
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There are several important differences between the analyses which make comparison 
impossible. These are outlined here. 

1) CLASS does not incorporate synchronization into the decoder performance analysis. 

This is obviously a critical issue which must be considered. Long burst lengths will occur 
at the Viterbi decoder output when PCI synchronization (and less so, Viterbi node 
synchronization) is lost. 

2) The OMV test results only consider bursts of long length but don’t consider the Poisson 
occurrence time of bursts in the real channel. Previous studies have shown that the 
S-band downlink is characterized by noise bursts which occur with Poisson statistics [3]. 
This is important because the duty cycle (taken to be 0.018 by Ted Kaplan in [1]) will 
result in more than one burst per interleaved Reed-Solomon code block. A duty cycle of 
0.018 with bursts of length 30 will cause an average error free guardband between bursts 
of 30/0.018=1667 binary symbols. Therefore, one RS interleaved block which contains 
10,200 binary symbols will result in approximately 10,200/1667=6 noise bursts. Each 
noise burst causes roughly 30 binary symbol errors, equivalent to roughly 30/8=4 RS 
code symbol errors. Therefore, 24 RS code symbols (24*8=192 binary symbols) will be 
in error on average due to the RFI. At first, it appears that these will be corrected with no 
trouble, however, because the occurrence times are Poisson for the RFI pulses, it is 
possible for some RS interleaved blocks to contain many more code symbol errors. It is 
unclear whether performance will be sufficient, in any case, the Poisson occurrences of 
the RFI bursts cannot be ignored. It is my belief that the RS decoder will have no trouble 
correcting the bursts which typically occur within one RS interleaved block. Note that 
the RS decoder does not allow error propagation due to the block nature of the decoder. 

The TDRS East environment is another matter, however. This environment is 
characterized by an interferer with a duty cycle of 1 1% or so. It is unclear whether the 
system, with or without the PCI, can handle this interferer. 

The Communication Link and Error ANalysis (CLEAN) simulator developed by me at 
MSU can help resolve the problem. Poisson occurring bursts can be generated to simulate the 
RFI in the real link and a soft Viterbi decoding program, which emulates node synchronization 
exactly like the LV7017C hardware, can be applied. This work is currently in progress along 
with the RICE compression work. 

Preliminary results suggest that the PCI is not necessary for the TDRSS West environment. 



Appendix 


The RICE Compression Algorithm: Theory and CLEAN Implementation 
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1.0 ABSTRACT 

In communication systems such as satellite data links, it is necessary to keep the 
bandwidth small due to limited channel and! or transmitter complexity. One way to alleviate the 
problem is to use digital data compression algorithms which reduce the number of bits 
required to represent a given amount of information. The RICE compression algorithm is 
frequently used in data links transmitting digital images from satellites to earth [1-5]. 

This paper summarizes RICE compression theory and simulation for a noiseless 
environment. The RICE simulation presented is an application specific to the Voyager II 
spacecraft, and is integrated into CLEAN, an existing software package. In conclusion, 
questions are presented for research relating to noisy simulations. 

2.0 INTRODUCTION 

The goal of all data compression schemes is to take source data and perform a reversible 
mapping which averages fewer output bits per symbol than the source. In general, the source 
data is first divided into words (symbols) of equal length and ordered in terms of decreasing 
symbol probability. Then, the most probable words are assigned codewords which are short 
relative to the corresponding source symbols. Similarly, the least probable words are assigned 
codewords which are long in length relative to the source symbols. Ideally, the average codeword 
length will approach the source entropy (entropy is the minimum number of bits/symbol required 
to represent the source by using any code). 

Many compression schemes, such as the Shannon-Fanno code, perform this mapping by 
table look-up. An example of a Shannon-Fanno code [6] is shown in Table 1. The source 
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symbols in Table 1 are 3 bits long and the average codeword length is 

L - £ LlxJPt; 
i - i 

or 2.75 bits. 


Source Symbols 

Probability 

Codeword 

Codeword Length 

Xo 

.2500 

00 

2 

x, 

.2500 

01 

2 

x 2 

.1250 

100 

3 

x 3 

.1250 

101 

3 

X 4 

.0625 

1100 

4 

x 5 

.0625 

1101 

4 

X* 

.0625 

1110 

4 

X 7 

.0625 

HU 

4 


Table 1. F.xamnle Shannon-Fanno Code. 

The constructs of the Shannon-Fanno code are not important The point being made here is that 
the Shannon-Fanno code mapping, as well as many other code mappings, is based on a priori 
table look-up. In reality, the source symbol statistics vary, so the symbol probability ordering 
in Table 1 can change and data expansion can occur. Therefore "table look-up" codes fall short 
when the "least probable" symbols occur too often. Thus these types of compression algorithms 
only work for a certain entropy range. Figure 1 illustrates performance for a typical "table look- 
up" code with different source entropies. Note this particular code performs best for source 


entropies from 2.5-4.5 bits/symbol. 
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Figure 1. Average Performance for a Typical Shannon-Fanno Code. 

The RICE compression algorithm is an adaptive code that employs ideas similar to the 
Shannon-Fanno code. RICE contains several different compression routines that each perform 
well under a different entropy range [4,5]. Basically, the RICE algorithm reads a block of source 
symbols, determines which compression routine is best suited for this block of data, encodes the 
symbols, and transmits the symbols along with a few ED bits which identify the compression 
routine used on this particular block. Therefore, the RICE compression algorithm can make 
adjustments for varying source symbol statistics. 

This paper discusses the general RICE compression theory, a RICE application, and a 
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computer simulation. Also, questions dealing with RICE decoding in a noisy environment are 
presented. 

3.0 THE RICE COMPRESSION ALGORITHM 

Let the sequence of any symbols, x t , x 2 , ...» x q .j be denoted as X={ Xj } . Then the entropy, 

H(X), is defined as 

H{X) - -J^Pilog 2 p t bits/ sample (2) 

where p, is the probability that occurs. The entropy of a data source is the theoretical limit for 
how many (actually, how few) bits/symbol are required to represent it. Practically all data 
sources have rime-varying entropies. The biggest advantage of RICE compression is that it can 
employ many types of compression algorithms, which collectively perform well over a wide 
range of entropies. The average performance plot for each RICE compression option looks like 
Figure 1, except each option is good over a different entropy range. The term, "RICE 
compression", does not imply the number or type of algorithms within it. This paper will only 
cover a few of them. 

3.1 PREPROCESSING 

No matter which code option is used, RICE’s first task is to order the symbol probabilities 
for each block. This is accomplished by reversible preprocessing which usually removes 
correlation from the symbols and orders them using a priori knowledge. From now on, it is 
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assumed that the following condition is true for each block of samples: 

Po * Pi ^ Pz ’•* * P a - 1 (3) 

where q is the number of symbols output from the source. Reversible preprocessing is 
summarized in Figure 2. The actual preprocessing method used will depend on the application. 



Once the condition in (3) is true or well approximated, RICE can choose which 
compression option is best for the current block. Let the compression options be denoted as % 
where i is an identifier. The ^identifiers used in this paper are identical to those in [5]. 

One code option is an obvious, trivial case. If the source symbols happen to be completely 
random, there is no need to encode them. An attempt to code them would most likely result in 
data expansion. Therefore, the simplest compression option is 

ip 3 [X] =X (4) 

3.2 FUNDAMENTAL SEQUENCE 

The simplest, non-trivial compression option is the fundamental sequence. The 
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fundamental sequence codeword operator is defined by 

fs [i] = 000. . . 0001 < 5 > 

where i is the magnitude of an input symbol and the output is i zeros followed by a "1". 
Obviously, the length of a fundamental sequence codeword is 

1 ± = L(fs[i]) = i+1 bits (6) 

Encoding J symbols as fundamental sequence codewords is denoted by 

tjj t [X] = FS [X] = fs [x t ] *fs[x 2 ] *-*fs[Xj] (7) 

where * implies concatenation and V F 1 is called the fundamental sequence of X. The length of 
a fundamental sequence is 

j J 

F - L^StX]) - £ L(fs [x^] ) - (8) 

Pi 

No matter how many bits each symbol contains, ^ would be powerful if lower 
magnitude symbols occurred most. This would be the case for highly-correlated data because 
the symbols output from the preprocessor (de-correlator) would be low in magnitude. Image data 
is a good example of this situation, since pixels on the same scan line are highly correlated [5]. 
The performance plot for the fundamental sequence is contained in Figure 4. Note that FS[X] 
performs well over H(X) of 1.5 to 3.0 bits/sample. 

3.3 % AND CFS[X] AND CFS[X] 

Let Y be a J-symbol sequence. Given a positive integer e, define the extended sequence 
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of Y to be Y concatenated with enough zeros to form a sequence whose length is a multiple of 
e. The extended sequence of Y is written as 

(9) 

y 1 = Extern = * (ys+iye+z-yzJ *“•* (yj-iyjM'" 0 ) 

There arc [7/el groups of e symbols in Ext*[Y], so there are e[7/el symbols total in Ext e [Y] 
where [J/e] is the smallest integer greater than or equal to J/e. 

As an example, let Y by the 29 bit sequence 

Y = 11010011010100111011010010111 < 10 > 

Then the 3rd extension of Y is given as 

Y f = Ext 3 [ Y] = ( 110 ) *(100) *(110) *(101) *(001) * (ll) 

( 110 ) *( 110 ) *( 100 ) *( 101 ) *( 110 ) 

where one dummy zero was added to complete the [19/3] = lO** 1 symbol of Y\ 

Compression options ¥ 2 and % attempt to remove any redundancy that may remain in 
the fundamental sequence. Clearly, from (5), it may be likely zeros in the fundamental sequence 
are more likely than ones. Define the second compression option as 

ip 2 [X] = CFSiX] = cfs [xj *cfs[x 2 ] *— < 12 ) 

where cfs means code the 3 rd extension of X mapped according to Table 2 [5]. The performance 
plot for CFS[X] is contained in Figure 4. Note that CFS[X] performs best for H(X) of 3 to 4.5 

bits/sample. 
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Input 3-tuple a 

Output Codeword: cfs[a] 

000 

0 

001 

100 

010 

101 

100 

110 

011 

11100 

101 

11101 

110 

11110 

111 

mil 


Table 2. ft-Word Code, cfstol. 

It is clear from Table 2 that when zeros are most likely in FS[X], compression will occur. It is 
possible that ones are more likely in FS[X]. Therefore, define the next compression option to 
be 

tp 0 [X] = CPSiX] * els [Xj] *c£s [x 2 ] *••• < 13 > 

where cTT comes from Table 2 with the left column complemented. The performance plot for 
CF3[X] is con tain ed in Figure 4. Note that CF3>[X] performs best for H(X) of 0 to 1.5 
bits/sample. 


3.4 THE BASIC COMPRESSOR 

All of the RICE compression options mentioned thus far collectively perform close to 
source entropies which range from 0 to 4.5 bits/sample. A block diagram for the four basic code 
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options is shown in Figure 3. Together, these code operators comprise the "basic compressor" 


* 3 c 3 



Figure 3. Four Basic Compressor Options. 
which is denoted as 

i*l 4 [X] = BC[X] * ID*ip W [X] (14) 

where ID is a concatenated 2-bit binary number representing 0, 1, 2, or 3, the compression option 
used for this sequence. Clearly, ID will be chosen such that 

2L{l*l W [X]) = min {L( l|l j [X ] )} (15) 


Rice suggests the ID decision rules outlined in Table [5]. The length of the basic compressor 





would be 


UBC[X ] ) _ 2 + L$>_idW) bits/ sample t 16 ) 

j J J 


where J is the number of samples. The rightmost term in (16) is assumed to be the shortest of 
the code options. 


Operator Decision 

Condition for FS[X] Length 

%[] 

F < 3 LJ/2J 

'Pit] 

3 LJ/2J < F < 3J 


3J < F < 3(m-2J) 

'Pat] 

F £ 3(m-2J) 

1 — 

Table 3 Basic Comnressor Decision Rules: F=FS length, J=no. samples, m=raw sequence length. 


The overhead associated with the basic compressor is 2/T bits/sample, the length of the 
ID bits. It appears that the overhead could be minimized by keeping J large. However, a large 
block size would give the basic compressor fewer chances to choose the best code option and 
the rightmost term in (16) may not be optimum. Studies by Spencer and May [7] suggest that 
the best block size is 16 to 25 samples. 

The performance plot for the basic compressor is shown in Figure 4. The trivial option, 
4*3, has been left out of the plot. This option would be a horizontal line at q, the number of 
bits/sample output from the source. 
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Source Entropy (blta/aample) 


Figure 4: Basic Compressor Performance. 


3.5 ¥ s : BLOCK-BY-BLOCK BASIC COMPRESSOR 

Let Y be an N sample sequence of samples partitioned into tj smaller blocks such that 

Y = Y 1 *Y 2 *’”*Y n < 17 > 

and each Y { is composed of J ( samples. Therefore 



( 18 ) 


The block-by- block Basic Compressor is the adaptive version of (14). That is, the block- 
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by-block Basic Compressor can change ID’s in the middle of a sequence. Define the block-by- 
block compressor as 

ip s m =v 4 [y x ] 4 W 4 ] (19) 

3.6 'F u , *F U : SPLIT-SAMPLE ENCODING 

There are many other compression algorithms that could be incorporated into RICE 
compression. The only other algorithm covered in this paper is split-sample encoding. Split- 
sample encoding recognizes when (and how many) least significant bits (LSB’s) in a source 
sample are random. When this is the case, these LSB’s are output "as is" and the remaining 
most significant bits (MSB’s) are compressed. When more LSB’s are random, the source entropy 
is higher, and split-sample encoding performs better. Therefore, split-sample encoding wotks well 
for high entropies. 

Let Mo" be a sequence of N preprocessed samples of n bits/sample such that (3) is 
satisfied. Define the split-sample operator (not the encoder) as 

SSS' k [M 0 n ] = (20) 

where 1^° is the N sample sequence consisting of k LSB’s of each sample of Mo", and Mo M is 
the N sample sequence consisting of the n-k MSB’s of each sample of Mo". The other subscript 
and superscript parameters will not be used, but are retained to stay consistent with [5]. A typical 
sample of this structure is illustrated in Figure 5. 
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n bits 



Figure 5. Typical Split-Sample Symbol. 

Define the split- sample encoder as 

Vi.jcCWo] = I**tp * [M?‘ k ] < 21) 

where i is the compression option used to encode the MSB’s, n is the number of bits/sample in 
the original sequence, k is the number of LSB’s, and n-k is the number of MSB’s. 

The decision criterion for i and k depends on the options available to the RICE 
compressor. Decision criterion for several options is given in [4,5], In this paper, only the 
decision rules for i=l will be examined. 

Since i=l, the only decision to make is k. Obviously, k will be chosen such that 


Lki 1Jt [Mj] } = min l{n> 1Jt [«?] } 

k 


Clearly, 


I.{ip ljc [<] } - £{!■?} - iff 1 K'*] } 

= WJc + L{||I ! MS" 1 '] } 


Let the sequence. Mo", be represented in terms of its samples 


Mq = m 1 *m 2 *—*m } 


( 22 ) 


(23) 


( 24 ) 


and let each sample be represented in terms of binary digits as 




where bj”' 1 is the MSB and bj° is the LSB. Substituting into (8), the length of the fundamental 
sequence of MSB’s is 


L{qi i [M $~ k ] } 



(26) 


Notice in (26) that the exponent on the two reflects the truncation of the k LSB’s. Rice [5] shows 
that when (26) is modified and substituted into (23), the length of the split-sample sequence is 


L{ IJJ 1(JC [W£]} * 2- Jc F 0 +^(l-2- k ) +Nk (27) 

where F 0 is (26) with k=0. Therefore, the RICE compressor must choose k such that (27) is 
minimized. 

Finally, define as 

ip xl [Mo] = i. k [MS] (28) 

where k’ is the binary representation of k. Note the similarities between (27) and (14). 


4.0 RICE SIMULATION 

Any compression option could be used for i in (21), including the Basic Compressor. 
Rice [3] has shown that i=l only provides good compression for the Voyager image entropy 
range. Therefore, this simulation only incorporates v F Ut . 

CLEAN, a communications simulation package developed by Mississippi State University, 
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is capable of incorporating RICE compression into many communication system configurations. 
The RICE portion of CLEAN has been adapted from existing code developed by the Jet 
Propulsion Laboratory (JPL). 

The image file format input to the RICE simulator is described in Figure 6. Figure 6a 



shows that the first record in the file contains image header data. The header data is defined as 
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follows: 

chi: "UO" if image is uncompressed, "Cl" if image is compressed 

nli: number of lines in the image file 

nbl: number of bytes per line 

nbp: number of bits per pixel 

nlr: number of label record 

nty(3): type of image file - set to 0 0 0 

ch2: user text - image title 

Figure 6b describes the image portion of the image file. One record is equivalent to one scan 
line. Therefore, the image file format is similar to the manner in which pixels are laid over a 
monitor. 

The RICE simulator processes one record at a time. Each record is broken into 16-pixel 
blocks. If a record is does not contain a multiple of 16 pixels, the last block is zero-filled. 
Therefore, for each scan line input to the RICE simulator, one reference pixel is output followed 
by 16 concatenations of (28) where n is the number of bits/pixel. In other words, the split- 
sample encoder has 16 opportunities per scan line to adjust to changing data statistics. A block 
diagram of the RICE simulator is shown in Figure 7. 

Pixels on the same scan line of image data are highly correlated. For example, adjacent 
pixels are usually about the same color and intensity. 

The purpose of the reversible preprocessing in Figure 7 is to alter the source symbols 
(pixels) such that (3) is well approximated. The probability ordering in (3) is achieved by using 
a priori information. In the case of a pixel, this a priori information is the previous pixel, or 
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Inage Noiseless Encoder 



Figure 7. Block Diagram of the RICE Simulator. 

reference pixel. The predictive decorrelator in Figure 7 subtracts the previous pixel from the 
current pixel, yielding a difference value, A. Since adjacent pixels arc approximately equal, the 
most likely values for | A| are close to zero. Therefore, the mapping in Figure 6 outputs integers 
(8’s) whose probability ordering matches the condition in (3). This mapping is outlined in 
Table 3. 

The 8 values arc well conditioned for split-sample encoding because they are 
mosdy low in magnitude. Therefore, their MSB’s will contain significant redundancy and their 
LSB’s will be somewhat random. 
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A Condition 

8 Assignment 

0 < A £ previous pixel 

2A 

A > previous pixel 

pixel value 

(previous pixel - maximum) ^ A < 0 

2|A|-1 

(previous pixel - maximum) > A > 0 

maximum - pixel value 


Table 3. A— >5 Mapping Rules. 

Note that first pixel from each scan line, the reference pixel, is sent uncoded. At the 
decompressor, the reference pixel is used in conjunction with the 8 values to reconstruct the scan 
line. 

The compressed image file output from the RICE simulator is described in Figure 8. The 
header for the compressed file is the same as Figure 6a. Clearly, each record of compressed data 
will be variable in length. Therefore, the number of bytes for each compressed scan line is stored 
at the beginning of each record. The reference pixel will be used with the decoded series of 8 


nbyte_pack reference ID 16 SS-code pixels 


ID 


16 SS-code 


pixels 


nil 

Lines 


nbyte_pack 


ref erence 


ID 


16 SS-code pixels 


ID 


16 SS-code pixels 


Figure 8. Compressed Image File Format. 
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values to reconstruct the original scan line. 

The ID bits tell the decompressor how many LSB’s (k) where split from the original 
pixel. Note that the ID bits say nothing about the length of the FS encoded MSB’s. An example 
of a split-sample encoded pixel is shown in Figure 9. 


0 

0 

0 

0 

1 

1 

0 


1 

1 

0 


FSCMSB's] LSB's 


Figure 9. Typical Split-Sample Encoded Pixel. 

As the RICE decompressor simulator reads the compressed image file from left to right, 
it must have some way of knowing where the encoded MSB’s begin and end and where the 
LSB’s begin and end. After the ID bits are read, the simulator will begin reading the FS encoded 
MSB’s. As soon as the simulator reads a "1", it assumes that this is the end of the FS encoded 
MSB’s and the next k bits are the LSB’s for the current 8. This 8 decompression is repeated 15 
more times (remember, the 8’s were encoded in 16-integer blocks), then the next ID is read and 
the process repeats until nbyte_pack bytes have been read in. Once all the 8 values have been 
decompressed, they will be used with the reference pixel to reconstruct the original scan line. 
The decompressor repeats all of this until nli lines have been processed. 

4.1 RICE/CLEAN INTEGRATION 

Programs called img2seq and seq2img provide the interface between JPL’s RICE 
compression code and MSU’s CLEAN code. Img2seq converts the RICE image file format to 
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the CLEA N sequence file format. Conversely, seq2img converts the CLEAN sequence file 
format to the RICE image file format. Both programs work for compressed or uncompressed 
formats. The image header data from record 1 of the image file is stored on records 60-100 in 
the sequence file. The image data starts on record 101 of the sequence file. 

The RICE/CLEAN integration exists to study the effects of a noisy channel on RICE 
decompression. Clearly, from Figure 9, errors in the LSB’s will only result in pixel distortion. 
However, errors in the ED bits or FS encoded MSB’s will may the decompressor to overlap the 
FS encoded MSB’s with the LSB’s. Consequently, synchronization of the FS encoded MSB’s, 
LSB’s, and ID bits would be lost Block loss, or even line loss could occur. In other words, 
errors in the appropriate positions would cause "error propagation" in the decoded pixels. 

5.0 CONCLUSION 

RICE compression performs quite well over a broad entropy range. However, the effects 
of noise on the decompressor output are still relatively unknown. The following questions about 
RICE need to be answered. Do errors output from inner error-correction codes cause catastrophic 
errors output from the RICE decompressor? If so, and extra error-correction encoding is needed, 
what is the net coding gain Will error propagation occur? If so, how is it stopped in real systems? 
Does error propagation really matter? What error statistics are important: pixel distortion, block 
loss, line loss, etc.? 
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A Directed Search Approach for Unit-Memory 

Convolutional Codes 1 2 

William J. Ebel, Member, IEEE 1 

Abstract - A set of heuristic algorithms to numerically search for good, binary Unit-Memory 
Convolutional Codes (UMC) are presented along with a large number of new codes for 2 < k < 8 
and code rate 1/4 < R < 1. Combinatorial optimization is used which involves selecting and then 
pairwise matching column vectors of the two (n,k) UMC tap weight matrices. The column 
selection problem is that of finding the best (2n,k) binary, linear Block Code (BC). In this 
paper, the best BC generator matrix G is found by successively refining G using directed local 
exhaustive searches. In particular, the set of minimum weight codewords are used to find a 
subset of G to exhaustively search. The UMC search strategy (pairwise matching problem) uses 
a directed local exhaustive search similar to the BC directed search by using the concept of the 
terminated BC of the UMC. The heuristic algorithms developed in this paper are very robust and 
converge relatively quickly to the optimal or near optimal UMC. In addition, although it is 
generally possible to achieve the block code upper bound for free distance, we give a class of 
UMC’s which cannot achieve this bound. 

Index Terms - Binary block code, binary unit-memory convolutional code, extended row 
distance, combinatorial optimization. 

I. INTRODUCTION 

Many conventional communication systems employ rate 1 In convolutional codes. These 
codes have a high performance/complexity ratio which make them well suited for practical 
applications. These codes can be implemented with Viterbi decoders that have only 2 S paths 
traversing a single trellis stage, where S is the number of decoder states. As an alternative, we 
consider the class of Unit-Memory Convolutional codes (UMC). A UMC has a fully connected 

1 This work was supported in pan by the NASA Goddard Space Flight Center under Contract NAG5-2006. 

2 The author is with the Department of Electrical and Computer Engineering, Mississippi State University, Box 
9571, Mississippi State, MS 39762. 



decoder trellis which requires a higher implementation complexity than a rate \ln code for the 
same number of decoder states. However, the best UMC generally has better distance properties. 
In this paper, we are concerned with finding the best ( n,k ) UMC’s for 2 ^ k < 8 and code rate 1/4 
<R< I. 

The Combinatorial Optimization (CO) search technique introduced by Said and Palazzo [1] 
involves selecting column vectors (columns selection problem) and then pairwise matching the 
column vectors (columns matching problem) to form the two tap weight matrices of an (n,k) 
binary Unit-Memory Code (UMC). The columns selection problem is essentially that of finding 
the best (2 n,k) binary, linear Block Code (BC). Said and Palazzo approached this using column 
vector substitutions and an objective function which adds a penalty for every codeword with 
Hamming weight less than the target d min . The columns matching problem requires matching the 
columns of the best BC to achieve a UMC with good distance properties. They also approached 
this using an objective function based on the extended row distance and randomly swapped 
columns until the objective function was optimized. In this paper, the CO technique is also used, 
however the algorithms have been designed to direct the search to improve efficiency and result 
in more optimal codes. 

Other UMC’s have been found using different memory structures and different optimizing 
criteria. Justesen et. al. [6] found a set of UMC’s with tap weight matrices composed of circulant 
submatrices using the free distance and the extended row distance as the optimizing criteria. 
Mooser [8] found many Periodically time-varying convolutional codes (PTVC), Lauer [4] and 
others [7] have found good Partial Unit-Memory Codes (PUMC). Multi-Memory Codes (MMC) 
have also been extensively studied [9,10,12]. It can be shown that the UMC is a superset of the 
PUMC, PTVC, and MMC [3,8]. 

In Section II to follow, the directed BC search algorithm is described, and in Section III, the 
directed UMC search algorithm is described. New Unit-Memory Convolutional codes are 
presented in Section IV and the conclusions follow in Section V. 



II. DIRECTED BLOCK CODE SEARCH 
A (2 n,k) block code (BC) is defined by the relation 

y -xG 

where G is the k x In generator matrix. The algorithm to be presented begins with an initial 

matrix G and then successively refines G until the best BC is found. Since the algorithm does 
not always operate on the best found generator matrix, let G denote the generator matrix being 
refined and let G* denote the best found generator matrix. The set of codewords for generator 
matrix G is denoted C, the Hamming weight distribution is denoted W ( IT, represents the number 
of codewords with weight /), and the minimum distance is d min which is subsequently denoted 8. 
Similarly, C\ W\ and 8* are associated with the generator matrix G\ 

We consider a BC optimal if 8 is a maximum, the number of non-zero minimum weight 
codewords W h is a minimum, and if the weight distribution is optimal. By optimal weight 
distribution, we mean one which has maximum partial second order moment defined by 

5 + w-l 

M w = I (i - 8) 2 W; 

i a 5 + I 

where w is the weight window width. Following the convention established above, M w denotes 

the partial second order moment of W . Since M w is not a function of W s 1 through fV 2 „, this 
definition leaves open the possibility for multiple "optimal" weight distributions. However, the 
best found (2n,k) BC will subsequently be used to construct an (n,k) UMC. Since it is unclear 
what effect the weight distribution from W& + w + 1 through W ln may have on the optimality of the 
best UMC, all BC’s with the same 8, Wg, and A/* are considered equally optimal. 

The motivation for requiring maximum 8 and minimum JF S is due to the fact that the best 
(n,k) UMC constructed from a (2n,k) BC will generally have a free distance of 8 and a number 
of free distance paths equal to Wg. This is discussed in Section III below. The motivation for 
defining the partial second order moment is twofold. First, we have observed that many 



different weight distributions for a (2n,k) BC have identical full second order moment M ln _g» 
and therefore the full moment does not provide sufficient "sensitivity" during the refinement 
process. Second, by maximizing M w , the refinement process is biased to select a weight 
distribution with the bulk of the weight grouped near 5 + w. Many weight distributions exhibit 
peaks for weights slightly greater than 5. For example, the best (12,5) BC has a weight 
distribution fV= [1, 0, 0, 0, 1, 8, 12, 8, 1, 0, 0, 0, 1]. In this case, choosing w = 3 biases the 
refinement process to settle on a code with this distribution. 

The main algorithm for successively refining G alternates between two algorithms called 
the Row Iteration Algorithm denoted A r and the Column Iteration Algorithm denoted A c . We use 
A r (G) to mean that algorithm A c is performed with G as the initial BC and we denote the set of 
parameters to be optimized by 0(G) = {5, JV&,M W }. Furthermore, 0(G )>0(G ) refers to the 
notion that G* is better than G. That is, we say that G* is better than G if the first criterion, in 
order of priority, which is different between G* and G is bettered by G . With these definitions, 

the main algorithm is given by: 

1) Randomly choose G. G <— G. 

2 ) g <- A r (G). If 0(G) > 0(G*), then G* <-G. 

3 ) G <^A C (G,M' W ). If O(G) > 0(G‘), then G* <- G. 

4) If not done, go to Step 2. 

At Step 4, the algorithm terminates if no better BC has been found after N A cycles through the 

main loop (Steps 2 and 3). 

A. Row Iteration Algorithm 

The row iteration algorithm requires that a set B of binary elements of G be selected and 
then exhaustively modified in order to refine G. As described above, a refinement takes place 
when 0(G) > G(G*), where G* represents the best found generator matrix in this algorithm. The 
set B is chosen by a voting method based upon those codewords which have weight near 8. 



Let X be the set of input vectors defined by 

X = {* : 5 < w(jcG) < 5 + w - 1} 

where w(iG) denotes the Hamming weight of codeword xG. There are 

5 + w - 1 

I W t 

i =6 

total vectors in X. Consider the input vector xzX which gives rise to the codeword xG. The 

element g i} of G receives a vote if both, x has a 1 in the t h component andxG has a zero in the/* 
component. This vote simply means that complementing g i} will change the weight of codeword 
x g. After all the votes have been cast for the vectors in X, the set B is chosen to be the N B 
elements of G which received the most votes. Modifying this subset of G is more likely to alter 
the weight distribution (W s through W^ w . x ) than an arbitrary set. We conjecture that an optimal 
BC is one with uniform (or nearly so) votes for the elements of G, and any transformation of G, 

inB. 

For example, we searched for the best (12,4) BC using w = 2 and N g = 8. The maximum 

achievable minim um distance is 6. During the search, a suboptimal code with d min = 5 and 

weight distribution W= [1, 0, 0, 0, 0, 5, 5, 2, 1, 1, 1, 0, 0] was found. The generator matrix G for 

this code had the following vote count for each corresponding element: 

'4 3224332333 3' 

221221112223 
221023312221 
21112110202 1 

At the completion of the algorithm, the optimal code with d min = 6 and weight distribution fV = 
[1, 0, 0, 0, 0, 0, 12, 0, 3, 0, 0, 0, 0] was achieved. The generator matrix vote count was found to 
be: 



'4 3033333434 3' 
343334033433 
444444444444 
44444444444 4 

This is more uniform than the suboptimal one. We note that the elements with zero vote counts 
was a common occurrence in our search experience. Most codes had between one and four zero 
vote counts, even when the rest of the vote counts were somewhat uniform. 

As a final note, before this algorithm was to be performed, a number of random 
transformations were applied to the input generator matrix G. A transformation preserves the 
weight distribution but generally gives rise to a different maximum vote count for the set of 
elements of G. The transformed generator matrix with the largest maximum vote count was 
chosen for the refinement procedure. 

B. Column Iteration Algorithm 

The column iteration algorithm involves exhaustively modifying each column, in order of 
priority, of G in order to refine G. The column priority is assigned by accumulating the vote 
counts of G for each column. If the dimension of the code is k, then there are 2 1 possible 

ways to modify a given column. Once a column has been exhaustively modified, the next 
column in order of priority (ties resolved randomly) is exhaustively modified. When all the 
columns have been modified, the algorithm starts over with the highest priority column again 
until N c columns have been modified. As above, a refinement takes place when 0(G) > 0(G *), 
where G* represents the best found generator matrix in this algorithm. 

Since the solution space is not convex [1], it is possible for a local minimum to be reached. 
To allow the algorithm to "back out" of local minima, an additional criterion involving the partial 
moment is added. That is, we consider 0(G) > O(G') if the partial moment M w is within a factor 
Q<K m <\ of the best BC partial moment M*. 



III. DIRECTED UNIT-MEMORY CODE SEARCH 
An ( n,k ) Unit-Memory Convolutional Code (UMC) is described by the k xn binary tap 
weight matrices F 0 and F,. These matrices relate the encoder input vector and output vector 
(called a code symbol) by 

y i =x i F 0 + x i _ l F } 

where x, is the /'*, ^-dimensional binary input vector, y, is the z'\ n-dimensional binary output 

vector, and where all operations are in GF(2). Let the ^-dimensional vector 5, denote the state of 
the encoder for the i‘ h input, where p is called the state complexity [3,4]. If F , is not full rank, 
then p<* and the UMC is called a Partial Unit-Memory Code (PUMC) [4], We will assume 
that F, is full rank so that p = k and s ( , = x, . , . Furthermore, we assume that x, = 0 for i < 0 and 
s 0 = 0. 

The tap weight matrices also define a BC. Observe that the input sequence x, = x,,, 0, ••• 

gives the output sequence y, = x 0 F 0 , x 0 F, , 0, • • • and can be written 

[To Ti] = i] = *o If. ^r] 

This also defines a BC and provides a relationship between the (2 n,k) BC G = [F 0 F,] and the 

(n,k) UMC with tap weight matrices F 0 and F,. For the BC G, let 6 be the minimum distance, 
Wj be the weight distribution, and let d fr „ be the free distance of the UMC. Since the paths given 
by the codewords of G are valid paths on the UMC decoder trellis, we must have d fr „ < 8. This 
is called the block code upper bound on free distance [3]. Brouwer and Verhoeff [5] have 
tabulated the tightest known bounds on 8 for every binary BC with n and k ranging from 1 to 
127. 

There are a set of UMC’s which cannot achieve the block code upper bound. First, it is 
easy to show that any (2n,k) BC which contains the all ones codeword must give rise to a 
catastrophic (n,k) UMC. Now consider the first order (2 k '\k) Reed-Muller code with generator 


matrix 






where G'*_, is a (k- 1) x (2* “ ’) matrix consisting of all possible 2* 1 column vectors. The 


weight distribution is 


W ,= 


1 , 

2 * - 2 , 

0 , 


j = 0J=n 
j =n/2 
else 


0) 


which achieves the d min upper bound. However, no other (2* \£) BC will achieve the d mim upper 


bound. Modifying any of the non-zero columns of G'*_, must necessarily result in a decrease in 
d min . Similarly, modifying the zero column of G'*_, or the first row must also necessarily result 
in a decrease in d mim . 

Theorem 1: Any (2*~\&) UMC cannot achieve the block code upper bound. 


Proof: Since every (2*"',Jfc) BC with maximum <4*. has a weight distribution which 

includes the all ones codeword, the resulting UMC must be catastrophic. 

Thommesen and Justesen have shown [2] that it is possible to extend the connection 

between a UMC and a BC. The BC with generator matrix 

'F 0 F, 0 0- 

0 F 0 F, - 0 

Gj — 

J | t « « • « » • 4 ft • • • * • 

0 F 0 F, 

is called the;'* terminated BC of the UMC, where j denotes the number of row block matrices, 

and the minimum distance is called the j ,h row distance dj. To minimize the decoded 
error-probability, it is necessary to maximize the row distances d] for j > 1. Indeed, since the 
maximum value for d\ is 8 (i.e. G\ = [F 0 F,]) and since d] must be a non-increasing function of 
;, the maximum possible row distances are 

</;= 5, y>i 



which occur when the block code upper bound is achieved. 

Consider a subset of the codewords generated by GJ. If I } is the set of input sequences 

where x, * 0 for i < j and x f = 0 for i>j, then the extended row distance is defined by 

d r , = min (w(xG r )} . 

J xelj 

This represents the minimum weight of those codewords which start at the all zero state and 
return for the first time at the (j + l)'* trellis stage. Therefore, the BC defined by G- on the set of 
input sequences Ij has minimum distance d r j. The free distance is given by 

dfree = (fy . 

Clearly, d] > d r } which gives d free = d r j> 5 for j > 1 when the block code upper bound is achieved. 


Since d] is the minimum distance for the BC defined by G- on the set of input sequences /,, 


the columns matching problem can be approached in a manner similar to the columns selection 
problem which was described in the previous section. In this paper, we optimize the second 
terminated BC defined by 



Ft 

F q 


O' 

F t 


on the set of input sequences I 2 and with minimum distance d\. The directed local exhaustive 


search is in the form of column permutations and the chosen optimizing criteria is a superset of 
those used in the row and column iteration algorithms. Column permutations have no effect on 
the weight distribution of G'. 

The algorithm presented below begins with an initial matrix G = [F 0 ^i] and then 

successively refines G with column permutations until the best UMC is found. The subsequent 

development follows closely with that given in Section II above. Let G denote the UMC being 

• 

refined and let G' denote the best found UMC. They'* terminated code is given by G] and (GJ) 



for G and G\ respectively. The set of codewords for generator matrix Q' on the set of input 

vectors /, is denoted C } , the weight distribution is denoted W } (fV j t represents the /'* weight value 

for the j th terminated BC), and the minimum distance is d- which is subsequently denoted 5 y. 

* 

Similarly, C-, Wj, and 8* are associated with the matrix (GJ) . 

In this paper, the UMC optimizing criteria, in order of priority, are chosen to be; (1) 
maximum d,- ree , (2) minimum number of free distance paths Nf ree , (3) maximum S 2 , (4) minimum 
W 2 62 , (5) maximum 6 3 , (6) minimum W 3 S} , (7) maximum partial second order moment of W 2 
which is denoted (8) maximum S p for some p » 1, (9) and minimum where P is an 
arbitrary, preselected index. The number of free distance paths is 

h j= d frtt 

Criteria (1) through (7) optimize G[ through G 3 r . We have observed that maximizing M 2tW 

promotes the selection of a UMC with a rapidly growing distance profile. Finally, criteria (8) 
and (9) are included to promote the elimination of a catastrophic code. Again, we use 
0(G‘) > 0(G) to denote the notion that'G’ is better than G using these prioritized criteria. 

The Viterbi decoding algorithm is used to determine the extended row distances 8, which 

gives 8j, 8 3 , 8 P , and d free . As long as Jj = d free occurs fory < P, it is only necessary to generate 
for 1 <j < p. By counting the number of paths which achieve 8, for each j gives N free , W 2 ^ t 
W 3Sy , and JTp V The second order partial moment M 2%w is found by constructing the weight 
distribution W 2 from G 2 on the set of input sequences / 2 . 

The main algorithm for successively refining G = [F’o ^i] alternates between two 

algorithms called the Column Permutation Algorithm A p and the Column Swap Algorithm A , and 
is given by: 

1) Input G (from the BC search algorithm in Section II). 

2) G *-A p (G). If 0(G) > O(G’), then G“ <- G. 



3) G <r- A s (G,Mlfy). If 0(G)> 0(G’), then C7 * <— U. 

4) If not done, go to Step 2. 

At Step 4, the algorithm terminates if no better UMC has been found after some number of 
cycles through the main loop (Steps 2 and 3). The tap weight matrices for the final UMC are 
[F 0 F ] ] = G\ 

A. Column Permutation Algorithm 

The column permutation algorithm A p (G) requires that a set of columns be identified and 

then systematically permuted (local exhaustive search) to refine G . Let jc 2 5 be the set of input 
vectors which result in minimum weight codewords for G 2 . That is, 

* 2.6 = {x :xel 2 , w(xG 2 ) = 8 2 } 

We define c 2 5 to be the set of minimum weight codewords in G[ and c, for 1 < i < 3n to be the 

number of codewords in c 2 5 which contain a binary zero in the /'* column. Changes in those 
columns which correspond to large values of c, are more likely to alter the weight distribution of 
G 2 and therefore the columns to be selected should come from this set. To localize the 
permutation search, however, pairs of columns are selected so that if column i < n is chosen then 
column i + n is also chosen. Operating on D c column pairs can only change 8 2 by D c . 

To select a useful set of column pairs, c, is folded to give 

c' t = c ( + + c f + 2 n , 0<i<n . 

The function c\ gives a measure of how likely the column pair (/,/ + «) will change the weight 

distribution of G\. Therefore, the chosen set of D c column pairs correspond to the largest c', 
values with ties resolved randomly. The Column Permutation Algorithm simply involves 
determining the set of column pairs via c'„ exhaustively permuting those column pairs, and 
retaining the UMC with the best optimizing criteria Q(G). 



B. Column Swap Algorithm 

This algorithm is similar to the column permutation algorithm. However, the permutation 
method is an exhaustive procedure where each trial involves swapping two columns of G. For n 
columns, this requires n(n — 1 ) / 2 swaps for one complete permutation set. As with the column 
iteration algorithm, the solution space is not convex and therefore it is possible for a local 
minimum to be reached. To allow the algorithm to "back out of local minima, an additional 
criterion involving the partial moment is added. That is, we consider 0(G) > 0(G‘ ) if the partial 
moment M l w is within a factor 0<K M <\ of the best partial moment M 2 * w . 

IV. NEW UNIT-MEMORY CODES 

The algorithms described in the previous sections were used to search for the best UMC’s 
with 2<k<% and with code rate 1/4 <F < 1. In searching for the best (2n,k) BC, the 
algorithms described in Section II generally used w = 3, \2<N B < 20, and K M = 0.7. In 
searching for the best (n,k) UMC, the algorithms in Section III used w — 3, P — 10, and — 0.7. 

Table I provides a list of parameters for the currently best known UMC’s. The minimum 
weight and number of minimum weight codewords for the (2n,k) BC with G = [F 0 F,] is given 
along with the block code upper bound [5]. Also shown arc the free distance, number of free 
distance paths, the extended row distance <? r for 1 < r < 5, and the corresponding number of paths 
W r 6 . The corresponding tap weight matrices are given in Table II, where each tap weight matrix 
column is represented by a base 10 number as shown. 

A total of 105 UMC’s are presented in Table II, most of which are new except for the 
following. Said and Palazzo [1] previously found the (10,7), (13,7), (11,8), and (24,8) UMC’s 
shown in the table, and Palazzo [12] found the (3,2), (4,2), (6,2), and (4,3) UMC’s using a 
network flow approach. Also, Dettmar and Shavgulidze found the (7,3) UMC [7]. Concerning 
the (8,7) UMC, our algorithms could only find a d free = 4 code. The one given in the table was 
brought to our attention by O. Ytrehus [11]. Finally, although all our codes are better than the 
Quasi-Cyclic UMC’s found by Justesen et. al. [6] based upon our criteria, a few of the 
Quasi-Cyclic UMC’s have faster rising extended row distance function. 



The large number of discovered codes which achieve the block code upper bound attest to 
the robustness of the directed search algorithms. In most cases, the ( n , k ) UMC which was found 
from the best (2 n,k) BC not only achieved d free = d mm but also achieved N free equal to the number 
of BC minimum weight codewords. The most noteworthy exceptions are the high rate codes 
which either do not attain the upper bound, or have a number of free distance paths which exceed 
the number of minimum weight codewords. The codes which do not attain the upper bound are 
the (7,6), (8,7), (9,7), (9,8), and (10,8) codes. 

Concerning the (15,6) UMC, we could not find a d min = 14 (30,6) BC without the all ones 

codeword. We suspect that one does not exist. Therefore, we found the best d free = 13 (15,6) 
UMC. Also shown in the table are the modified upper bounds for the set of (2 ,k ) UMC s via 
Theorem 1 and the upper bound for the (3,2) code which can be shown to be 3 via exhaustive 
search. In some cases, the BC which achieves the maximum d min was not found. These are the 
(46,7), (50,8), (52,8), (58,8), and (64,8) BC’s. We present these UMC’s as the best which have 
been found to date but note that more optimal ones almost surely exist. 

As a final note, a 66MHz Pentium-based PC was used to search for the codes. The BC 
search required between 5 minutes and an hour to converge, and the UMC search required 
between 5 minutes to (approximately) 24 hours to converge. The exponential growth in required 
computer time inhibited the search for codes larger than those presented in this paper. 

V. SUMMARY AND CONCLUSIONS 

In this paper, algorithms have been described which use combinatorial optimization and 
directed local exhaustive searches to find the best known Unit-Memory Convolutional (UMC) 
codes. A total of 105 UMC’s are presented most of which are better than previously known 
codes. In addition, a class of UMC’s are given which cannot achieve the block code upper 


bound. 
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Table I. Comprehensive list of the best known Unit-Memory Convolutional (UMC) Codes for 
2 < k <, 8 and code rate 1/4 < R < 1 . 



















































































































































































































































































































































































































































<» Vi* exhaustive search, upper bound is reduced to 3. 
iZ) See Reference [ 1 2]. 

0) See Reference [7]. 

Block code upper bound reduced by one due to Theorem l . 


0> < d mtm of the [Ho 

<‘> Algorithms failed to find a d mtm = 14. (30.6) BC without the all ones codeword. 
<7> See Reference [ 1 ]. 

<S) Upper bound not achieved for the [H* Hi] BC [5], 


i9) See Reference [11]. 


























































































































































































































































































































































































































































































































































































Tap weight matrices for the best known Unit-Memory Convolutional (UMC) Codes for 
2 < k < 8 and code rate 1/4 <R < 1 . 
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Technical Note 


Date: March 17, 1994 

To: Warner Miller, Victor Sank, GSFC 

From: William J. Ebel, Mississippi State University 

Subject: TRMM Performance Without the PCI 

Cc: Frank Ingels 

Abstract - The developments of the Communication Link and Error ANalysis (CLEAN) 
simulator have reached the point where accurate assessments of the performance of the TDRSS 
downlink without the PCI but with node synchronization in the Viterbi decoder can be 
performed. In this note, the applicable developments of CLEAN are described and preliminary 
results of the PCI study are presented. 


I. Introduction 

As you are well aware, the TDRSS downlink includes an optional PCI and a required rate 
1/2 constraint length 7 convolutional code. The demodulator provides 8-level soft-decision data 
to the DePCI and subsequently to the Viterbi decoder. Tap synchronization of the DePCI or 
node synchronization of the Viterbi decoder must be established at the receiver depending upon 
whether the PCI is switched on. The question which has been posed and discussed [1,2] is 
whether the PCI is necessary for the TDRSS downlink as used by TRMM. 

In Section II to follow, applicable developments of CLEAN are described and their 
relevance to the actual system is discussed. In Section in, preliminary studies of the 
performance of the TDRSS downlink for TRMM without the PCI is presented and conclusions 
follow in Section IV. 



II. CLEAN Simulator Developments 

Two recent CLEAN developments are relevant to the issue at hand; 1 ) program vit3sync 
which mimics the LV7017C Viterbi decoder with node synchronization, and 2) program rfi 
which mimics the effect at the soft-decision demodulator output of multiply occurring RFI 
sources in the TDRSS downlink. Besides these, program blkdeint which performs depth 5 block 
deinterleaving and blkdecod which performs Reed-Solomon (RS) decoding for the (255,223) RS 
code are used to estimate performance. These latter two programs are described in [3]. 

The soft-decision data output by the TDRSS downlink receiver at White Sands is input to 
the LV7017C hardware for PCI/node synchronization and Viterbi decoding. The specific details 
of the synchronization strategy and Viterbi decoding can be found in [4], The algorithm to 
perform node synchronization, when the PCI is not switched on, and soft-decision Viterbi 
decoding in the LV7017C hardware has been exactly duplicated in the CLEAN program 
vit3svnc. This includes the Viterbi decoder trellis metrics, the metric renormalization strategy, 
the node synchronization strategy using a SyncCounter, the path memory length, etc. 

Both functional and statistical verifications of program vit3sync were performed. For the 
statistical verification, two critical performance factors for the LV7017C Viterbi decoder were 
considered; (1) the Viterbi decoder output Bit Error Rate (BER), and (2) the average time to 
detect loss of node synchronization. To consider item (1), Figure 3.1 shown on the next page 
was extracted from [4], This figure shows a comparison of the LinCom Viterbi decoder 
simulation results with theoretical bounds assuming an AWGN channel. The cross hairs show 
corresponding simulation results for program vit3sync of the CLEAN simulator. The BER for 
vit3sync matches the LinCom simulation exceptionally well. To consider item (2), various soft 
values were deleted from a post threshold data stream. The average time to detect loss of node 
synchronization has been shown to be roughly 200-300 bits at the Viterbi decoder output [4,5] 
for E b /N () = 5dB. For vit3sync, average time to detect loss of node synchronization was observed 
to be 200-350 bits at the Viterbi decoder output. 

Program rfi was written to mimic the effect several RFI sources have on the soft values 
output by the 8-level receiver threshold device and also includes thermal noise. It is assumed 
that the occurrence time for noise bursts are Poisson in distribution and that the burst duration 
spans 1 5 threshold output soft values. The program requires the following inputs: 

1 ) Hard-decision error probability due to thermal noise alone 

2) Number of RFI sources 

and for each RFI source, the following inputs are required: 

1 ) Hard-decision error probability during a burst (inclusive of thermal noise) 

2) Average interval between bursts (reciprocal of rate of burst occurrence) 



3) Burst length (in terms of the number of threshold output soft values) 

The philosophy for determining the soft values output by the 8-level threshold device is 
as follows. When no RFI occurs, the random variable at the threshold input is Gaussian with a 
mean and variance which are related to the signal power and noise power, respectively. The 
ratio of the signal power to the noise power is the signal-to-noise ratio. In fact, there is a 
Q-function relationship between the hard-decision error probability and the signal to noise ratio 
for the random variable at the threshold input. When an RFI noise burst occurs, it has the effect 
of increasing the noise power but does not affect the signal power, and therefore has the effect of 
increasing the hard-decision error probability. However, the probability density function of the 
random variable at the threshold input is still Gaussian. Therefore, to determine the probability 
density function of the random variable at the threshold input, it is only necessary to know the 
hard-decision error probability at the threshold output during each RFI source burst. In 
summary, to determine which soft value is output by the thresholder, it necessary to determine: 

1) which RFI sources are currently causing a burst to occur 

2) the total error probability for the current threshold output value 

3) the variance of the Gaussian random variable to be thresholded 

The method used to statistically generate the soft output value for a given hard-decision error 
probability is described in [6] for program iidsoft. 

In effect, the program can accurately represent the statistical nature of the soft values at 
the 8-level threshold output due to thermal noise and multiple RFI sources. To justify the 
appropriateness of this model, recall that all the distortions which occur in the real TDRSS 
downlink including satellite non-linearities, hardware distortion, etc. are lumped into a single 
parameter called the implementation loss which is realized in terms of an offset to the 
signal-to-noise ratio at the receiver [7]. 

HI. Preliminary PCI Study Results 

Several studies were conducted to investigate the performance of the TDRSS downlink 
for TDRS West and TDRS East and for the SSA return link. These RFI environments may be 
found in [8] and are summarized below. 

The first study conducted involved simulating the TDRS West environment as described 
by Ted Kaplan in [2] for TRMM. The TRMM data rate is 2Mbps (4MHz channel rate) and each 
RFI pulse is 3.5 usee long so that at most 15 threshold output soft values will be affected by a 
single burst. The signal-to-thermal noise ratio at the receiver was taken to be E b /N 0 = 4.5dB so 
that the Viterbi decoder output error probability is roughly 10' 5 (with no RFI). This corresponds 
to a thermal noise hard-decision post threshold error probability of about 0.0465. Of the 5 



known RFI sources which exist for the SSA return link for TDRS West, the two were ignored 
and three were treated as a single source with infinite power consistent with [2], This is 
summarized in Table I [8]. 

Given this channel model, the following simulation was conducted using CLEAN. 

1 ) Generate PN code to simulate data (1 .5x1 0 7 bits) using program pnseq 

2) Convolutionally encode the PN sequence using the NASA rate 1/2 constraint length 7 
convolutional code using program convened 

3) Generate the soft sequence which would occur at the threshold output for the encoded 
binary sequence using programs bstysoft and soften. 

4) Viterbi decode the received soft sequence (no PCI present) using vit3sync which 
includes node synchronization and mimics the LV7017C hardware decoder 

5) Determine the error sequence at the Viterbi decoder output using program madd 

6) Block deinterleave the error sequence (depth 5) using program blkdeint 

7) Reed-Solomon decode the deinterleaved sequence using program blkdecod 

The results are summarized in Table II and suggest the following. If the signal EIRP is chosen 
so that the Viterbi decoded BER (with no RFI) is 10' 5 per spec (E b /N 0 = 4.5dB) but RFI for 
TDRS West happens to be in the channel (clearly a worse case scenario), then the Reed-Solomon 
decoder can correct many of the errors and provide a total system BER in the neighborhood of 
2 x 10" 6 . It is difficult to draw tangible conclusions from these results because the TDRS West 
RFI environment has been simplified (as in [2]) and because the simulation only resulted in 1 RS 
decoding failure out of 7350 codewords which is not a very good statistical estimate. However, 
these results bring into question the results shown in Figure 1 of reference [2] which shows the 
RS decoder output BER to be about 2 x lCr 4 for an E b /N 0 = 4.5dB and no PCI. If this were the 
true RS decoder output BER, then there should have been on the order of 3000 binary errors at 
the RS decoder output. 

To more accurately represent the TDRS East and TDRS West RFI environments, 
program rfi of the CLEAN simulator was used to simulate all the RFI sources for two different 
thermal noise EIRP values as shown in Tables III, IV, V, and VI. The signal EIRP was taken to 
be 31.6dBW which gives a hard-decision threshold output BER of 0.0283 when 29dBW thermal 
noise is present with no RFI and 3.47 x KT 4 when 24dBW thermal noise is present with no RFI. 
The burst length of every RFI burst was taken to be 15 soft-decision threshold output values 
consistent with [2], 

These environments were used with the CLEAN simulation described above to 
investigate the performance of the NASA concatenated coding scheme including the rate 1/2 
constraint length 7 convolutional inner code with the (255,223) RS outer code. The PCI was not 
switched on and program vit3sync was used to perform Viterbi decoding which includes node 



synchronization. The results are shown in Tables VII, VIII, IX, and X. These results suggest 
that for the worst case scenario, the system will fail for the TDRS East environment. Although 
decoding failure did not occur for the TDRS West environment, no tangible conclusions can be 
drawm at this time due to insufficient statistics. 

IV. Conclusions 


In conclusion, the Communication Link and Error ANalysis (CLEAN) simulator is a 
useful tool for investigating the performance of the TDRSS downlink. The issue of whether the 
PCI is necessary for the TDRSS downlink has been investigated. Preliminary results suggest 
that the TDRS West RFI environment may not be a problem without the PCI, but that the worst 
case TDRS East environment may give rise to significant decoding failures out of the 
Reed-Solomon decoder. 
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Table I. Model of the TDRS West RFI environment 


EIRP Burst Duty Cycle 

(dBW) Length (%) Comments 


25 

15 

3.5 

ignored 

35 

15 

2.2 

ignored 

45 

15 

0.6 

1.8% D.C. 

55 

15 

1.1 

single source 

65 

15 

0.1 

BER = 1/2 


Table II. Performance results for the TDRS West RFI environment model 


Simulation 

Location 

Simulation 

Length 

Total Observed 
Binary Errors 

Observed 

BER 

Hard-decision 
Threshold Output 

3x 10 7 

1.64 xlO 6 

(2.68 x 10 5 in bursts) 

0.0547 

Viterbi Decoder 
Output 

1.5 x 10 7 

1.0 x 10 s 

(no node sync loss) 

6.68 x 10~ 3 

Reed-Solomon 
Decoder Output 

1.5 x 10 7 

34 

2 x 10 -6 

(1 decoding failure out of 
7350 codewords) 





































Table III. Model of the TDRS East RFI environment for thermal noise EIRP of 29dBW 


EIRP 

(dBW) 

Duty 

Cycle (%) 

Total Noise w/ 
Thermal (dBW) 

EM 

Raw Channel 
BER 

Thermal 

N/A 

29 

2.6 

0.0283 

20 

10 

29.5 

2.1 

0.0359 

30 

13 

32.5 

-.9 

0.101 

40 

3 

40.3 

-8.7 

0.301 

50 

2 

50.0 

-18.4 

Single 

60 

1.8 

60.0 

-28.4 

Source 

70 

0.2 

70.0 

-38.4 

BER = 1/2 


Table IV. Model of the TDRS East RFI environment for thermal noise EIRP of 24dBW 


EIRP 

(dBW) 

Duty 

Cycle (%) 

Total Noise w/ 
Thermal (dBW) 

EM 

Raw Channel 
BER 

Thermal 

N/A 

24 

7.6 

3.47 x 10^ 

20 

10 

25.5 

6.1 

0.00216 

30 

13 

31.0 

0.6 

0.0649 


3 

40.0 

-8.5 

0.297 

50 

2 

50.0 

-18.4 

Single 

60 

1.8 

60.0 

-28.4 

Source 

70 

0.2 

70.0 

-38.4 

BER = 1/2 









































































Table V. Model of the TDRS West RFI environment for thermal noise EIRP of 29dBW 


EIRP Duty Total Noise w/ Raw Channel 


(dBW) Cycle (%) Thermal (dBW) E IN 0 BER 


Thermal 

N/A 

29 

2.6 

0.0283 

25 

3.5 

30.4 

1.1 

0.0543 

35 

2.2 

36.0 

-4.4 

0.197 

45 

0.6 

45.1 

-13.5 

0.382 

55 

1.1 

55.0 

-23.4 

Single 

65 

0.1 

65.0 

-33.4 

BER = 1/2 


Table VI. Model of the TDRS West RFI environment for thermal noise EIRP of 24dBW 


EIRP 

(dBW) 

Duty 

Cycle (%) 

Total Noise w/ 
Thermal (dBW) 

EM 

Raw Channel 
BER 

Thermal 

N/A 

24 

7.6 

3.47 x 10" 4 

25 

3.5 

27.5 

4.1 

0.0117 

35 

2.2 

35.3 

- 3.7 

0.178 

45 

0.6 

45.0 

-13.4 

0.380 

55 

1.1 

55.0 

-23.4 

Single 

65 

0.1 

65.0 

-33.4 

BER = 1/2 


































































Table VII. Performance results for the TDRS East RFI environment model with 29dBW thermal 

noise EIRP 


Simulation 

Location 

Simulation 

Length 

Total Observed 
Binary Errors 

Observed 

BER 

Hard-decision 
Threshold Output 

2x 10 6 

1.31 xlO 5 

(8.99 x 10 4 in bursts) 

0.0657 

Viterbi Decoder 
Output 

1.0 x 10 6 

2.6 xlO 4 

(no node sync loss) 

0.0260 

Reed-Solomon 
Decoder Output 

1.0 x 10 6 

2.31 xlO 4 

0.0232 

(411 decoding failures out 
of 490 codewords) 


Table VIII. Performance results for the TDRS East RFI environment model with 24dBW 

thermal noise EIRP 


Simulation 

Location 

Simulation 

Length 

Total Observed 
Binary Errors 

Observed 

BER 

Hard-decision 
Threshold Output 

2 x 10 6 

7.35 x 10 4 

(7.30 x 10 4 in bursts) 

0.0367 

Viterbi Decoder 
Output 

1.0 xlO 6 

1.83 xlO 4 

(no node sync loss) 

0.0183 

Reed-Solomon 
Decoder Output 

1.0 xlO 6 

9.65 xlO 3 

0.00965 

(213 decoding failures out 
of 490 codewords) 

































Table IX. Performance results for the TDRS West RFI environment model with 29dBW thermal 

noise EIRP 


Simulation 

Location 

Simulation 

Length 

Total Observed 
Binary Errors 

Observed 

BER 

Hard-decision 
Threshold Output 

2x 10 6 

8.11 xlO 4 

(2.84 x 10 4 in bursts) 

0.0406 

Viterbi Decoder 
Output 

1.0 xlO 6 

6.33 xlO 3 
(no node sync loss) 

0.00633 

Reed-Solomon 
Decoder Output 

1.0 x 10 6 

0 

0.0 

(0 decoding failures out of 
490 codewords) 


Table X. Performance results for the TDRS West RFI environment model with 24dBW thermal 

noise EIRP 


Simulation 

Location 

Simulation 

Length 

Total Observed 
Binary Errors 

Observed 

BER 

Hard-decision 
Threshold Output 

2 xlO 6 

2.58 xlO 4 

(2.51 x 10 4 in bursts) 

0.0129 

Viterbi Decoder 
Output 

1.0 xlO 6 

4.59 xlO 4 
(no node sync loss) 

0.00459 

Reed-Solomon 
Decoder Output 

1.0 xlO 6 

0 

0.0 

(0 decoding failures out of 
490 codewords) 





























Model of the new TDRS East RFI environment 
for thermal noise EIRP of 29dBW (BW = 20MHz) 


EIRP 

(dBW) 

Duty Cycle 
High RFI 

Duty Cycle 
Low RFI 

Total Noise w/ 
Thermal (dBW) 


Raw Channel 
BER 

Thermal 

N/A 

N/A 

29 

2.6 

0.0283 

30 

6.5 


32.5 

-0.9 

0.101 

40 

2.5 

1.5 

40.3 

-8.7 

0.301 


3.5 

1.0 

50 

-18.4 

1/2 


Performance results for the new TDRS East High RFI environment model 
with 29dBW thermal noise EIRP 


Simulation 

Location 

Simulation 

Length 

Total Observed 
Binary Errors 

Observed 

BER 

Hard-decision 
Threshold Output 

lxlO 6 

1.11 x 10 s 

(6.15 x 10 4 in bursts) 

0.0557 

Viterbi Decoder 
Output 

5.0 xlO 5 

9.33 x 10 3 
(no node sync loss) 

0.0186 

Reed-Solomon 
Decoder Output 

5.0 x10 s 

4.50 x 10 3 

0.00900 

(95 decoding failures out 
of 245 codewords) 


Performance results for the new TDRS East Low RFI environment model 
with 29dBW thermal noise EIRP 


Simulation 

Location 

Simulation 

Length 

Total Observed 
Binary Errors 

Observed 

BER 

Hard-decision 
Threshold Output 

1 x 10 6 

8.06 xlO 4 

(2.77 x 10 4 in bursts) 

0.0403 

Viterbi Decoder 
Output 

5.0 x10 s 

2.97 xlO 3 
(no node sync loss) 

0.00594 

Reed-Solomon 
Decoder Output 

5.0 x10 s 

0 

0 

(0 decoding failures out of 
245 codewords) 



























































Model of the new TDRS East RFI environment 
for thermal noise EIRP of 23.5dBW (BW = 20MHz) 


EIRP 

(dBW) 

Duty Cycle 
High RFI 

Duty Cycle 
Low RFI 

Total Noise w / 
Thermal (dBW) 


Raw Channel 
BER 

Thermal 

N/A 

N/A 

23.5 

8.1 

1.64 x 10 -4 

30 

6.5 

4.5 

30.9 

0.7 

0.0627 

40 

2.5 

1.5 

40.1 

-8.5 

0.297 

50 

3.5 

1.0 

50 

-18.4 

1/2 


Performance results for the new TDRS East High RFI environment model 
with 23.5dBW thermal noise EERP 


Simulation 

Location 

Simulation 

Length 

Total Observed 
Binary Errors 

Observed 

BER 

Hard-decision 
Threshold Output 

1 x 10 6 

5.69 xlO 4 

(5.66 x 10 4 in bursts) 

0.0284 

Viterbi Decoder 
Output 

5.0 x10 s 

6.99 xlO 3 
(no node sync loss) 

0.0140 

Reed-Solomon 
Decoder Output 

5.0 x10 s 

1.51 xlO 3 

0.00302 

(37 decoding failures out 
of 245 codewords) 


Performance results for the new TDRS East Low RFI environment model 
with 23.5dBW thermal noise EIRP 


Simulation Simulation Total Observed Observed 

Location Length Binary Errors BER 

Hard-decision 2.49 x 10 4 

Threshold Output 1 x 10 6 (2.45 x 10 4 in bursts) 0.0124 

Viterbi Decoder 2.09 x 10 3 

Output 5.0 x 10 5 (no node sync loss) 0.00419 

Reed-Solomon 0 

Decoder Output 5.0 xlO 5 0 (0 decoding failures out of 

245 codewords) 



























































Model of the new TDRS East RFI environment 
for thermal noise EIRP of 22dBW (BW = 4MHz) 


EIRP Duty Cycle Duty Cycle Total Noise w/ Raw Channel 

(dBW) High RFI Low RFI Thermal (dBW) E /N 0 BER 


Thermal 

N/A 

N/A 

22 

9.6 

9.76 x 10' 5 

30 

6.5 

4.5 

25.5 

6.1 

0.00216 

40 

2.5 

1.5 

33.3 

-1.7 

0.122 

50 

3.5 

1.0 

43 

-11.4 

0.351 


Performance results for the new TDRS East High RFI environment model 
with 22dBW thermal noise EIRP 


Simulation 

Location 

Simulation 

Length 

Total Observed 
Binary Errors 

Observed 

BER 

Hard-decision 
Threshold Output 

1 xlO 6 

3.06 x10 s 

(3.04 x 10 4 in bursts) 

0.0153 

Viterbi Decoder 
Output 

5.0 xlO 5 

3.77 xlO 3 
(no node sync loss) 

0.00753 

Reed-Solomon 
Decoder Output 

5.0xl0 5 

0 

0 

(0 decoding failures out of 
245 codewords) 


Performance results for the new TDRS East Low RFI environment model 
with 22dBW thermal noise EIRP 


Simulation 

Location 

Simulation 

Length 

Total Observed 
Binary Errors 

Observed 

BER 

Hard-decision 
Threshold Output 

1 x 10 6 

1.10 x 10 4 

(1.10 x 10 4 in bursts) 

0.00548 

Viterbi Decoder 
Output 

5.0 x10 s 

918 

(no node sync loss) 

0.00184 

Reed-Solomon 
Decoder Output 

5.0 x10 s 

0 

0 

(0 decoding failures out of 
245 codewords) 




























































Model of the new TDRS East RFI environment 
for thermal noise EIRP of 16.5dBW (BW = 4MHz) 


EIRP 

(dBW) 

Duty Cycle 
High RFI 

Duty Cycle 
Low RFI 

Total Noise w/ 
Thermal (dBW) 

EM 

Raw Channel 
BER 

Thermal 

N/A 

N/A 

16.5 

15.1 

4.32 x 10“ 16 

30 

6.5 

mm 

23.9 

7.7 

3.01 x 10^ 

40 

2.5 

1.5 

33.1 

-1.5 

0.117 

50 

3.5 

1.0 

43 

-11.4 

0.351 


Performance results for the new TDRS East High RFI environment model 
with 16.5dBW thermal noise EIRP 


Simulation 

Location 

Simulation 

Length 

Total Observed 
Binary Errors 

Observed 

BER 

Hard-decision 
Threshold Output 

lxlO 6 

3.00 x 10 4 

(3.00 x 10 4 in bursts) 

0.0150 

Viterbi Decoder 
Output 

5.0 xlO 5 

3.37 x 10 3 
(no node sync loss) 

0.00674 

Reed-Solomon 
Decoder Output 

5.0 x10 s 

0 

0 

(0 decoding failures out of 
245 codewords) 


Performance results for the new TDRS East Low RFI environment model 
with 16.5dBW thermal noise EIRP 


Simulation 

Location 

Simulation 

Length 

Total Observed 
Binary Errors 

Observed 

BER 

Hard-decision 
Threshold Output 

1 xlO 6 

1.07 xlO 4 

(1.07 xlO 4 in bursts) 

0.00533 

Viterbi Decoder 
Output 

5.0 x10 s 

786 

(no node sync loss) 

0.00157 

Reed-Solomon 
Decoder Output 

5.0 x10 s 

0 

0 

(0 decoding failures out of 
245 codewords) 
























































